-C4 



LAW OFFICES 

FiNNEGAN, Henderson, 
Farabow, Garrett 
% dunner,l.l.p, 

STANFORD RESEARCH PARK 
70 O HANSEN WAY 
PALO ALTO^ CALIF. 94304 
650-649-6600 



A 



PATENT 

Attorney's Docket Number: 4121.0116-07 



IN THE UNITED STATES PATENT AND TRADEMARK OFFICE ^ 



ASSISTANT COMMISSIONER FOR PATENTS 
Washington, D.C. 20231 



(DC 

0 

■n 



CERTIFICATE OF EXPRESS MAILING 
I hereby certify that this correspondence is being deposited 
with! the United States Postal Service's "Express Mail Post 
Office to Addressee" service under 37 CFR § 1 .10, in an 
envelope addressed to: BOX PATENT APPLICATION, 
Assistant Commissioner For Patents, Washington, D.C. 
20231, on August 4. 2000 . 

Express Mail Label No. EL 589694137 IjS 



Signed: rX/Jf^}//^) 



Prior Application: 



Art Unit: 1656 
Examiner: S. Houtteman 



Linda Phillips 7^ 



SIR: This is a request for filing a Divisional Application under 37 C.F.R. § 1.53(b) of 
pending prior application Serial No. 08/957.709 . filed October 24. 1997 of Hollv 
Hoqrefe et al., for POLYMERASE ENHANCING FACTOR (PEF) EXTRACTS PEF 
PROTEIN COMPLEXES, ISOLATED PEF PROTEINS, AND METHODS FOR 
PURIFYING AND IDENTIFYING SAME. 



1. 



2. □ 

3. ■ 

4. □ 



Enclosed is a complete copy of the prior application including the oath or 
Declaration and drawings, if any, as originally filed. On infonnation and belief, 
I hereby verify that the attached papers are true copies of prior application 
Serial No. 08/957.709 as originally filed on October 24. 1997 . and of the 
Declaration filed March 9, 1998. 

Enclosed is a substitute specification under 37 C.F.R. § 1.125. 
Cancel Claims 1 to 39. 45 to 68. and 75 to 94 . 
A Preliminary Amendment is enclosed. 

The filing fee is calculated on the basis of the claims existing in the prior 
application as amended at 3 and 4 above. 
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Total Claims 


11 




20 




x$18 




Independent Claims 
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3 




x$78 




[X] Presentation of Multiple Dep. Claim(s) 


+$260 


260.00 


Subtotal 


$ 950.00 


Reduction by 1/2 if small entity 


.00 


TOTAL APPLICATION FILING FEE 


$ 475.00 
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12. 



Please do not charge the filing fee to Deposit Account No. 06-0916. 

The Commissioner is hereby authorized to charge any fees which may be 
required including fees due under 37 C.F.R. § 1.16 and any other fees due 
under 37 C.F.R. § 1 .1 7, or credit any overpayment during the pendency of 
this application to Deposit Account No. 06-091 6. 

Amend the specification at page 1 , after line 2, by inserting the following 
paragraph: 

-This application is a division of application Serial No. 08/957,709, filed 
October 24, 1997, which is incorporated by reference for any purpose.-- 

New formal drawings are enclosed. 

The prior application is assigned of record to: Strataaene. 



1 101 1 North Torrev Pines Road. La Jolla. California 92037 



11. □ Priority of application Serial No. 



filed on 



in 



(country) Is claimed under 35 U.S.C. § 1 19. A certified copy 

□ is enclosed or □ is on file in the prior application. 
A verified statement claiming small entity status 

□ is enclosed or ■ is on file in the prior application. 
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16. □ 



The power of attorney in the prior application is to at least one of the 
following: FINNEGAN, HENDERSON, FARABOW, GARRETT & DUNNER, 
L.L.P., Douglas B. Henderson, Reg. No. 20,291; Ford F. Farabow, Jr., Reg. 
No. 20,630; Arthur S. Gan-ett, Reg. No. 20,338; Donald R. Dunner, Reg. 
No. 19,073; Brian G. Brunsvold, Reg. No. 22,593; Tipton D. Jennings, IV, 
Reg. No. 20,645; Jeriy D. Voight, Reg. No. 23,020; Laurence R. Hefter, Reg. 
No. 20,827; Kenneth E. Payne, Reg. No. 23,098; Herbert H. Mintz, Reg. 
No. 26,691; C. Larry O'Rourke, Reg. No. 26,014; Albert J. Santorelli, Reg. 
No. 22,610; Michael C. Elmer, Reg. No. 25,857; Richard H. Smith, Reg. 
No. 20,609; Stephen L. Peterson, Reg. No. 26,325; John M. Romany, Reg. 
No. 26,331; Bruce C. Zotter, Reg. No. 27,680; Dennis P. O'Reilley, Reg. 
No. 27,932; Allen M. Sokal, Reg. No. 26,695; Robert D. Bajefsky, Reg. 
No. 25,387; Richard L. Stroup, Reg. No. 28,478; David W. Hill, Reg. 
No. 28,220; Thomas L. Irving, Reg. No. 28,619; Charles E. Lipsey, Reg. 
No. 28,165; Thomas W. Winland, Reg. No. 27,605; Basil J. Lewris, Reg. 
No. 28,818; Martin I. Fuchs, Reg. No. 28,508; E. Robert Yoches, Reg. 
No. 30,120; Barry W. Graham, Reg. No. 29,924; Susan Habennan Griffen, 
Reg. No. 30,907; Richard B. Racine, Reg. No. 30,415; Thomas H. Jenkins, 
Reg. No. 30,857; Robert E. Converse, Jr., Reg. No. 27,432; Clair X. Mullen, 
Jr., Reg. No. 20,348; Christopher P. Foley, Reg. No. 31,354; John C. Paul, 
Reg. No. 30,413; David M. Kelly, Reg. No. 30,953; Kenneth J. Meyers, Reg. 
No. 25,146; Carol P. Einaudi, Reg. No. 32,220; Walter Y. Boyd, Jr., Reg. 
No. 31,738; Steven M. Anzalone, Reg. No. 32,095; Jean B. Fordis, Reg. 
No. 32,984; Roger D. Taylor, Reg. 28,992; Barbara C. McCurdy, Reg. 
No. 32,120; James K. Hammond, Reg. No. 31,964; Richard V. Burgujian, 
Reg. No. 31,744; J. Michael Jakes, Reg. No. 32,824; Thomas W. Banks, 
Reg. No. 32,719; Christopher P. Isaac, Reg. No. 32,616; Bryan C. Diner, 
Reg. No. 32,409; M. F'aul Barker, Reg. No. 32,013; Andrew Chanho Sonu, 
Reg. No. 33,457; David S. Forman, Reg. No. 33,694; Vincent P. Kovaiick, 
Reg. No. 32,867; James W. Edmondson, Reg. No. 33,871 ; Michael R. 
McGurk, Reg. No. 32,045; Joann M. Neth, Reg. No. 36,363; Gerson S. 
Panitch, Reg. No. 33,751; Cherl M. Taylor, Reg. No. 33,216; Charles E. Van 
Horn, Reg. No. 40,266; Linda A. Wadler, Reg. No. 33,218; Jeffrey A. 
Berkowitz, Reg. No. 36,743; Michael R. Kelly, Reg. No. 33, 921; and James 
B. Monroe, Reg. No. 33,971; Doris Johnson Hines, Reg. No. 34,629; Allen 
R. Jensen, Reg. No. 28,224; Lori Ann Johnson, Reg. No. 34,498; and David 
A. Manspeizer, Reg. No. 37,540. 

The power appears in the original declaration of the prior application. 

Since the power does not appear in the original declaration, a copy of the 
power in the prior application is enclosed. 



!0 



Page 4 of 4 



17. ■ Please address all correspondence to FINNEGAN, HENDERSON, 

FARABOW, GARRETT and DUNNER, L.L.P., 1300 I Street, N.W., 
Washington, D.C. 20005-3315. 

18. □ Recognize as associate attorney 



1 9. □ Also enclosed is 



PETITION FOR EXTENSION . If any extension of time is necessary for the filing of this 
application, including any extension in the parent application, serial no. 08/957.709 . 
filed October 24. 1997 . for the purpose of maintaining copendency between the parent 
application and this application, and such extension has not otherwise been requested, 
such an extension is hereby requested, and the Commissioner is authorized to charge 
necessary fees for such an extension to our Deposit Account No. 06-0916. A duplicate 
copy of this paper is enclosed for use in charging the deposit account. 
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Polymerase Enhancing Factor (PEF) Extracts, PEF Protein Complexes, 
Isolated PEF Proteins, and Methods for Purifying and Identifying Them 



Related Apolication Information 

This application claims priority to and is a continuationHn-part of U.S. Patent application serial 
no. 08/822,774, filed March 21, 1997. The entire contents of that application are specifically 
incorporated herein by reference and may be relied on to make and use embodiments of the 
claimed invention. 

Background of the Invention 

Field of the Invention 

The invention relates to the fields of nucleic acid polymerases and nucleic acid 
polymerization reactions. 

Introduction 

The efficiency of a nucleic acid polymerization reaction has implications for numerous assays 
and techniques. For example, the ability to enhance polymerase activity in a PCR process 
increases the sensitivity of the PCR-based assay. We have identified, produced, purified, and 
analyzed novel extracts, proteins, and complexes that improve the polymerization activity of nucleic 
acid polymerases. Included within the aspects of the present invention are methods for identifying 
compositions with a polymerase enhancing activity, methods for purifying and using these 
compositions, and specific extracts, proteins, and complexes that function to enhance polymerase 
activity. 

Description of Related Art 

Manipulating nucleic acids with polymerization reactions is a fundamental component of 
biotechnology-related research. These reactions permit researchers to replicate DNA or RNA in 
vitro, which in turn allows cloning or amplification of specific nucleic acids or groups of nucleic acids. 
Numerous other examples exist detailing the critical nature of a nucleic acid polymerization reaction 
or a nucleic acid polymerization enzyme in a particular technique, including sequencing nucleic 
acids, mutagenesis of nucleic acid sequences, and producing nucleic acid probes for hybridization. 
Of particular cun^ent interest are amplification reactions, such as PCR, that have greatly increased 
the rate at which researchers can perfomi nucleic acid related experimentation. Extremely rare 
nucleic acids can now be amplified and manipulated using these techniques, which necessarily 
involve nucleic acid polymerases. 

Using techniques with an amplification step has driven concem for the efficiency, fidelity, and 



sensitivity of the polymerase used. This has resulted in efforts to both analyze and optimize 
polymerization conditions for a variety of applications. (Lundberg et al., Gene 108; 1-6 (1991); 
Eckert and Kunkel, PGR Methods Applic. 1: 17-24 (1991); Ling et at., PGR Methods Appiic. 1: 63-69 
(1991); Brail et al., Mutat. Res, 303: 75-82 (1994); Canity and Wold, P.NAS. 89: 1021-1025 (1992); 
Taylor and Logan, Gun-. Opin. Biotechnol. 6: 24-29 (1995)) In particular, quantitative amplification- 
based reactions rely upon the ability to efficiently amplify each nucleic acid species present in a 
sample. (See Ausubel, et al., Ghapter 15, In: Gurrent Protocols in Molecular Biology, John Wiley & 
Sons (1992) and supplements through 1995.) Thus, both a concern for the accuracy of and a need 
for new methods to enhance the performance of amplification-based nucleic acid techniques exists 
in the art. 

One way in which these concerns and needs have been addressed is through the use of 
additives to the amplification reaction. Different additives act at different points in the amplification 
process. For example, formamide has been used to increase the specificity of PGR with GG rich 
target sequences, which are particularly susceptible to intramolecular hybridization that may prevent 
hybridization with a primer. (Sari^ar, G. el: al. Nucl. Acids Res. 18: 7465 (1990)). It has also been 
reported that tetramethylammonium chloride increases yield and specificity of PGR reactions. 
(Ghevet, E., et. a!., Nucleic Acids Res, 23:3343-3334 (1995).) Hung et al. report the reduction in 
multiple satellite bands from amplifying complex DNA when dimethyl sulfoxide (DMSO) is added. 
(Hung, T., et al. Nucl. Acids Res. 18: 4953(1990).) The multiple satellite bands often present 
problems in purifying the desired amplification product from the other DNA present. 

Gertain proteins have been used to stabilize hybridized nucleic acids during replication. For 
example, E. coli single-stranded DNA binding protein has been used to increase the yield and 
specificity of primer extension reactions and PGR reactions. (U. S. Patent Nos. 5,449,603 and 
5,534,407.) The gene 32 protein (single stranded DNA binding protein) of phage T4 apparently 
improves the ability to amplify larger DNA fragments (Schwartz, et al., Nucl. Acids Res. 18: 1079 
(1990)) and enhances DNA polymerase fiidelity (Huang. DNA Gell. Biol. 15: 589-594 (1996)). In 
addition, bacterial thioredoxin combined with T7 DNA polymerase (Sequenase™; Amersham-USB) 
has been used,to increase processivity, but the combination is not active at high temperatures, such 
as those used in PGR. 

Another way amplification-based assays and techniques have been improved is through the 
development of modified polymerases or the use of combinations of polymerases. (U.S. Patent 
5,566,772) For example, the TaKaRa long PGR kit employs two polymerases (Takara Shuzo Go., 
Ltd; Japan), and a number of polymerase combinations were also tested by Barnes (Proc. Nat. 



Acad. Sci. USA, 91:2216-2220 (1994). Tmncated Taq and T, flavus DNA polymerase enzymes that 
apparently exhibit increased themnostability and fidelity in PGR have also been suggested. (U.S. 
Patent 5,436,149.) Combinations of polymerases with and without 5' 3' exonuclease or 3' 5' 
proofreading activity have also been used. (U.S. Patent No. 5,489,523) 
5 Further, amplification-based assays and techniques have been improved through empirical 

testing of conditions, reagents, and reagent concentrations to optimize polymerization reactions with 
a particular enzyme. Temperature and length of amplification cycles, primer length, and pH, for 
example, are all conditions that can be optimized, (Bames, Proc. Nat. Acad. Sci. USA, 91:2216- 
2220 (1994).) 

10 However, accessory proteins can be even more useful in improving polymerase activity 

and/or the processivity of polymerases. "Processivity" in this context refers to the number of 
enzymatic reactions occuning each time an enzyme binds to its substrate. In the context of nucleic 

3 acid replication reactions, "processivity" means the number of bases that can be replicated when the 

S polymerase binds to a priming site. An increase in processivity directly relates to longer replication 

45 products. 

Z Intracellular replication has been shown to involve accessory proteins, as characterized in E. 

coli, human, and phage T4 systems. The accessory proteins interact with polymerases to improve 
" activity and provide the high processivity necessary to replicate genomic DNA efficiently while 
3 avoiding unacceptable mutation rates. Since the accessory proteins can be used in combination 
lo with the other improvements noted above, the development and application of accessory proteins 
P holds particular promise for enhancing the results of nucleic acid replication-based reactions. 

Accessory proteins have been identified in eukaryotes, E. coli, and bacteriophage-T4 and are 
thought to form "sliding clamp" structures, (Kelman and O'Donnell, Nucl. Acids. Res. 23(18): 3613- 
3620 (1995).) These structures are thought to tether the polymerase to DNA, thereby increasing 
25 processivity. The sliding clamp structures, however, have largely been studied in in vitro model 
systems. Only in the case of T4 polymerase has knowledge of the activity of such accessory 
proteins been used to improve polymerization-based techniques employed by researchers in the art. 
For example, accessory proteins of the T4 holoenzyme have been reported to improve processivity 
when added to polymerization systems using T4 polymerase. (Young et al., Biochem, 31(37): 8675- 
30 8690 (1992); Oncor Fidelity™ Sequencing System, Oncor; Gaithersburg, MD.) However, since the 
T4 accessory proteins are derived from bacteriophage, they are not likely to enhance polymerases 
from bacteria, archae, or eukaryotes. Thus, the use of T4 accessory proteins is believed to have 
been limited to techniques where T4 polymerase is used. 
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The presence of dUTP (deoxyuracil triphosphate) in a polymerization reaction and the effect 
of deoxyuridine-containing DNA on DNA synthesis have also been examined. In particular, 
deoxyuridine in a DNA strand has been shown to inhibit polymerization by archael DNA 
polymerases. (Lasken, ef a/., (1996) J. BioL Chem. 271; 17692-17696.) While Lasken et ai 
5 reported that archeal DNA polymerases, such as Vent, are inhibited by DNA containing 

deoxyuridine, they do not discuss the effect of removing uracil-containing nucleosides or nucleoside 
triphosphates from the reaction to prevent incorporation, Furthemnore, they do not discuss any 
enzyme that acts on or tums over dUTP in a reaction. Neither do they mention any dUTPase 
activity or the possible effect of dUTPase activity on polymerization reactions. In addition, Lasken et 
10 ai do not appreciate the fact that dUTP is generated during the course of a nomial PGR reaction by 
the deamination of dCTP. As a result of the deamination, dUTP will be present and be incorporated 
into an amplified nucleic acid, inhibiting the polymerase activity. Thus, the art has not appreciated 
r j the potential of dUTPase activities and proteins in enhancing replication reactions. 
|=Jf Accondingly, since present knowledge and use of accessory proteins has led to limited 

ujS applications in replication-based techniques, there continues to exist a need in the art for new and 
% more widely useful compositions for enhancing polymerase enzyme activity. The present invention 
M meets this need. 

Q Summary of the invention 

% The present invention comprises extracts, protein complexes, and related proteins that 

.pO possess nucleic acid polymerase enhancing activity useful in a variety of replication reactions known 
% in the art. Thus, the extracts, protein complexes, and related proteins of the invention function to 
enhance a wide spectaim of in vitro nucleic acid replication reactions by providing, inter alia, 
replication products of superior length, fidelity or both, and at higher yields. As used in this 
specification and appended claims "polymerase enhancing activity" means the ability to increase the 
25 rate, fidelity, and/or yield of a nucleic acid polymerization reaction mediated by a nucleic acid 

polymerase, or to expand or alter the range of conditions under which such reaction does or may 
proceed. 

In one aspect of the invention, extracts of Pyrococcus furiosus (Pfu) cells are provided that 
enhance the activity of Pfu DNA polymerase. The extracts enhance nucleic acid replication product 
30 yields over a fairiy broad range of concentrations and contain at least one polymerase enhancing 
factor. As used in this specification and in the appended claims, the temi "PEP" includes purified 
naturally occurring polymerase enhancing factors and wholly or partially synthetic copies or active 
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analogs thereof. In accordance with the invention, such extracts can be further purified by heparin 
affinity chromatography followed by sepharose gel purification. Additionally, PEFs can be identified 
and purified using the antibodies of this invention, discussed below. While Pfu cell samples were 
used and are specifically exemplified below, one skilled in the art will appreciate that other cell 
samples can be used to identify and purify PEFs. For example, other species of the archae 
Pyrococcus or Thermococcus can be used as well as thermophilic bacteria cells and other bacteria 
cells. In addition, eukaryotic cells and tissues can be used as a source for PEF, as demonstrated by 
the cloning and expression of human dUTPase, which also enhances polymerase activity. Thus, the 
invention also comprises compositions and methods wherein a dUTPase or any activity that turns- 
over dUTP is capable of acting to enhance a nucleic acid polymerization reaction. 

In another aspect of the invention, PEF complexes are provided. The PEF complexes of the 
invention possess polymerase enhancing activity and generally comprise multiple protein subunits 
with a combined molecular weight of approximately 250kD or above as determined by SDS-PAGE 
analysis and gel filtration of unheated PEF samples. An example of one PEF complex (P300) was 
purified from Pfu cell sample extracts. The predominant components of the complex are a 50kD 
protein (P50) and a 45kD protein (P45). Heat treating the Pfu P45 with 2% SDS and 1% TCA 
produces a 17-18kD protein, which represents the fully denatured form. However, the Pfu PEF 
complex contains other minor components with approximate apparent molecular weights of 150, 100, 
85, 60, 55, 42, and 37kD. At least two components (150 and 100) have been shown to be dimeric 
or polymeric forms of P50. Thus, the PEF complexes of the invention comprise protein components 
and function to enhance the activity of polymerases. 

In another aspect of the invention, Pfu proteins possessing polymerase enhancing activity are 
provided. These proteins have molecular weights between approximately 42 and 60kD by SDS 
PAGE analysis under partially denaturing conditions. The 42-60kD proteins may be used alone or in 
combination to enhance polymerase activity. Methods for purifying these proteins as well as the 
PEF extracts and PEF complexes from which they have been isolated are also provided. 

The invention also involves two particular proteins, Pfu P50 and P45, which are predominant 
components otthe PEF complex (P300). Detailed structural and functional information on the Pfu 
P45 and P50 proteins is disclosed. The P50 protein is similar in structure to a bacterial flavoprotein. 
The P45 protein is similar in structure to dCTP deaminase, functions as a dUTPase, and possesses 
polymerase enhancing activity. The structural information herein can be used to generate specific 
hybridization probes that detect the presence of nucleic acids encoding a protein that is part of a 
PEF complex, or related proteins from samples from other species, or possesses PEF activity. 



Furthermore, the structural information can be used to generate proteins from expression systems 
known in the art, synthetic proteins, partially synthetic proteins, or proteins made from a combination 
of natural proteins, expressed proteins, and synthetic proteins. Methods for detecting the presence 
or absence of polymerase enhancing activity and/or dUTPase activity are also included in this 
invention and can be used to identify the various active PEF proteins or analogs. In addition, 
polyclonal or monoclonal antibodies that bind to PEF components can be produced, for example 
from purified P45 or P50, purified PEF complexes (P300), or another PEF of the invention. These 
antibodies can then be employed in assays and kits, well known in the art, in order to identify the 
presence or absence of a PEF. 

The understanding of the catalytic activity of PEF, and the P45 protein in particular, provides 
aspects of this invention directed to polymerase enhancing proteins, as well as methods, kits, and 
compositions containing a dUTPase activity or dUTPase protein as a PEF. Thus, a dUTPase activity 
or dUTPase protein or composition can be used to enhance nucleic acid replication, polymerization, 
or PGR reactions according to this invention. In fact, any activity that functions to turn-over dUTP 
can be used as a polymerase enhancing activity of this invention. Wide-ranging sources for the 
dUTPase activity, protein, or composition exist, as it is demonstrated to be present from both archael 
and human sources, the ends of the phylogenetic possibilities. Thus, any cell or species can be 
used as a source for polymerase enhancing activity or PEF. 

Kits for replicating nucleic acids and methods for using the PEF complexes, specific proteins 
of the complexes, and extracts containing PEF are also provided. In addition, the complexes, 
proteins, and extracts can be used in compositions comprising a polymerase. Ideally, the 
polymerase will be one that is enhanced by the complex, protein, or PEF. The PEF extracts, 
complexes and proteins of the present invention are particulariy useful in mixtures with nucleic acid 
polymerases, such as native polymerases, those produced by recombinant DNA techniques, and kits 
containing such polymerases. 

Also provided in the invention are methods for identifying proteins or complexes that 
influence nucleic acid polymerases. The source of the protein can be any bacterial, archael, or 
eukaryotic species. Certain embodiments involve methods for identifying proteins affecting 
polymerases used in amplification reactions, for example, alpha-type DNA polymerases such as 
DNA polymerases derived from Pyrococcus and Thermophilis species. Other embodiments involve 
the analysis of dUTPase activity as well as computer implemented screening methods to identify a 
PEF. 



Description of the Drawings 
Figure 1. PGR enhancing activity in a heparin sepharose column fraction. PGR enhancing 
activity was measured using the 6.2 kb system described in example 1. Golumn fraction SCS #36 
H.S. #78 (prep. 2) was diluted in 1 x cloned Pfu PGR buffer and aliquots of the following were 
5 added to 100//1 PGRs. Figure 1: lane 1, buffer; lanes 2,4 column fraction diluted 1:10; lanes 3,5 
column fraction diluted 1:100. In Figure 1, lanes 4 and 5. AAA742 template DNA has been omitted 
from the reactions. 

Figure 2. PGR enhancing activity in a heparin sepharose column fraction. Figure 2: lane 1, 
buffer; lanes 2,5 column fraction undiluted; lanes 3,6 column fraction diluted 1:10; lanes 4,7 column 
10 fraction diluted 1:100. In Figure 2, lanes 5-7, cloned Pfu DNA polymerase has been omitted from 
the PGRs. 

Figure 3. PGR enhancing activity of SDS-PAGE gel-purified samples from heparin sepharose 
fraction SGS #36 H.S. #78 (prep. 2). Figure 3 shows the PGR enhancing activity of proteins eluted 
f from SDS-PAGE gel slices 1-7 (native Pfu DNA poL; lanes 1-7) and 9-20 (H.S. #78; lanes 9-20 from 
is Figure 4). The proteins were eluted as described in Example 2. One (1)//i of each gel slice, diluted 
Z 1:100 in cloned Pfu PGR buffer, was added to cloned Pfu PGRs as described in Example 1 (6.2kb 
■^^ primer-template system). In the left lanes of the gel is shown PGR product synthesis in the 
^ presence of 1//I of buffer (-) or H.S. #78, diluted 1:1000 (.001//I), 1:10,000 C0001//I), or 1:100,000 
3 (.00001//I). DNA mari<ers were electrophoresed in lane "m". 

^0 Figure 4. Figure 4 shows a duplicate of the master SDS-PAGE gel from which gel slices 

p were excised. The following proteins were electrophoresed on both gels: lane A, 8//1 of native Pfu 
i DNA polymerase (lot #24); lane B, pre-stained molecular weight mariners (Novex); lane G, heparin 

sepharose fraction SGS #36 H.S. #78 4//I («160 ng PEF). The samples were not pre-heated before 
loading, and the duplicate gel shown here was silver-stained. Gel slices 1-7 were recovered from 
25 lane A on the master gel, while slices 8-25 were recovered from lane G of the master gel. 

Figure 5. DNA polymerase activity in SDS-PAGE gel purified samples. The level of DNA 
polymerase activity (cpms incorporated) in gel slice eluates (Vl) was measured as described in 
Example 14. -The polymerase activity exhibited by gel-purified proteins numbered 1-7 in the native 
Pfu DNA polymerase preparation (Figure 4; lane A) is shown by the solid line (left Y axis). The 
30 polymerase activity of gel-purified proteins numbered 8-25 in fraction H.S. #78 (Figure 4, lane G) is 
shown with the broken line (right-handed Y axis). The apparent molecular weights of the proteins 
tested are shown on the x axis (at the top) and are inferred from the position the gel slices were 
recovered, relative to pre-stained molecular weight mariners. Gel slices #11 and 18 exhibited the 
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highest PGR enhancing activity. 

Figure 6. SDS-PAGE analysis of gel-purified P. furiosus PEF. The following samples were 

electrophoresed on 4-20% acrylamide gels in the presence of 1% SDS: lanes 2,3- SCS #37 H.S. 

fraction #75 (prep. 4; 10//! of a Microcon 30 concentrated fraction; «100 ng PEF) lanes 4,5 10//I of 
5 SDS-PAGE gel slice eluate recovered from SCS #37 H.S. fraction #75 (migrated 0-4mm above 

250kD protein marker). The samples in tanes 3 and 5 were boiled for 3 minutes before loading onto 

the SDS-PAGE gel. The completed gel was stained with silver stain. The migration of pre-stained 

molecular weight markers is shown in lane 1 . 

Figure 7. SDS-PAGE analysis of S200-purified P. furiosus PEF. 10//I (2.25 //g) of S200- 
10 purified PEF (prep. 1) was incubated in the presence of 2% SDS for 3 minutes at room temperature 

(RT), 75''C, SS^C, or lOO^C (boil), as indicated on the lanes, and then subject to electrophoresis as 

described in Example 2. Proteins were detected by silver-staining. Protein molecular weight 

mariners were run in lane "m". 
=1 Figure 8. PGR enhancing activity of SDS-PAGE gel-purified samples from an S200-purified 

rj15 P. furiosus PEF preparation (prep 1). Purified PEF (*4.5 //g) was pre-incubated in 2% SDS for 3 

minutes at room temperature (RT) or at 85°G (85''G) before electrophoresis. Proteins were eluted 
(2 from SDS-PAGE gel slices 1-13 (gel RT) and 14-26 (gel 85''C) as described in Example 2. One 
■=« (1)//l of each gel slice, diluted 1:100 in cloned Pfu PGR buffer, was added to cloned Pfu PGRs as 

O described in Example 1 (6.2kb primer-template system). The approximate molecular weights of the 

•:|20 SDS-PAGE gel-purified proteins is indicated at the top of the gel. DMA size mariners were run in 
J lanes "m". 

3 Figure 9. S200-purified PEF (1 .4 pg total protein) was heated at 85 or lOO'C prior to 

electrophoresis. Four slices were excised from the gel between the 60kD (slice 1) and 42kD (slice 
4) mari<ers. The proteins were eluted in 50//I of buffer and lOpI aliquots were boiled in the presence 

25 of SDS-BME loading dye and run out on 4-20% PAGE gels. Lane 1 : Novex pre-stained mariners; 
lane 2: 0.28 ng PEF; lanes 3-6: gel purified proteins isolated from S200-purified PEF heated at 
SS-'G- slice 1 (lane 3), slice 2 (lane 4), slice 3 (lane 5), and slice 4 (lane 6); lanes 7-10; gel purified 
proteins isolated from S200-purified PEF heated at 100°G: slice 1 (lane 7), slice 2 (lane 8), slice 3 
(lane 9), and slice 4 (lane 10). 

30 Figure 10. SDS-PAGE analysis of P. furiosus PEF preparations. In the left panel, the 

following samples were subject to SDS-PAGE silver stain analysis, conducted as described: cloned 
Pfu DNA polymerase lot#24A (56 ng///l) [lanes 1-3, 1//I, 2//I, 4//I]; S200 purified P. furiosus PEF 
fraction #46 (prep. 1 from SGS #38) [lanes 4- 2//I]; S200 purified P. furiosus PEF pool fractions #47- 
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48 (550 ng///l; prep. 3 from SCS #38)[lanes 5-8, Afj\, .2/j\, Afj\, In the right panel, the following 
samples were mn: cloned Pfu DNA polymerase lot#24A (56 ng///l) [lanes 1-3, 1//1, 2//I, 4/ill; microcon 
30-concentrated SCS #36 heparin sepharose fraction #78 (prep. 2) [lanes 4-6, 1//I, 2^/1, 0.4//I]; SCS 
#37 heparin sepharose fraction #75 (prep. 4) [lanes 7-8. 1//I, 5//I]. 
5 Figure 11. PDVF blots of P. furiosus PEP. PEF-containing heparin sepharose fractions 

(from SCS #37 Pfu purification) were concentrated and aliquots electrophoresed in 8 or 9 lanes on 
4-20% SDS-PAGE gels as described. The samples were boiled for 3 min. prior to loading to recover 
the 50kD monomeric PEF (top) or were loaded in the absence of heat treatment to recover the 
>250kD aggregate (bottom). The proteins were transferred to PDVF filters (BioRad) and stained 

10 with Amido black. 

Figure 12. PVDF blot of S200-purified P. furiosus PEF. Approximately 20//g of total protein 
was electrophoresed in each of 7 lanes on a 12% PAGE gel. The samples were heated at 85''C for 

-l 5 minutes prior to loading. The proteins v/ere transfen-ed to PVDF filters and stained as in the 

;i Figure 11 legend. 

Figure 13. Absorbance spectrum of S-200 purified P. furfosws PEF. The spectra of the 

:1; following were obtained on a Shimadzu UV160U spectrophotometer: 0.7 mg/ml P. furiosus PEF, 
9.6//M and A.SfM riboflavin, and PEF final storage buffer (blank). 

Figure 14. Recombinant P45 (rP45) amplification enhancement of 5.2 kb fragment. In the 

3 5.2 kb "On/OfT assay with PEF and rP45 samples, 7, 0.7 or 0.2 ng of the native PEF protein or 5//i, 

:|0 1 //I, or 1 A/I of a 1/10 dilution of the rP45 protein (20 ng/ul) was added to the on/off assay. 

=p Figure 15. SDS-PAGE analysis of fully denatured native PEF and rP45. The following 

;r| protein samples were electrophoresed on 4-20% SDS-PAGE gels as described in the text: 100 ng 

native PEF, boiled in 2%SDS without (lane 1) or with 1%TCA (lane 2); 200ng recombinant P45 clone 
1 , boiled in 2%SDS (lane 3) or loaded in the absence of heat-treatment (lane 5); 200ng recombinant 

25 P45 clone 23, boiled in 2%SDS (lane 4) or loaded in the absence of heat-treatment (lane 6). The 
migration of molecular weight markers is shown in lane 7 (far right). The gel was stained with 
SYPRO orange dye (Molecular Probes), diluted 1:5000 in 5% acetic acid. 

Figure -16. Reverse Phase HPLC Analysis of dUTP Conversion by PEF and rP45. Panel A, 
dUTP heated at 72<'C in 1x cloned Pfu polymerase buffer (negative control). Panel B, dUTP heated 

30 in 1x cloned Pfu polymerase buffer with 700 ng native PEF. Panel C, dUTP heated in 1x cloned Pfu 
polymerase buffer with 5 fA of rP45 (clone 1), Panel D, dUMP standard heated in 1x cloned Pfu 
buffer, Panel E, the products shown in panel B and panel D were mixed and injected simultaneously 
to demonstrate, through the production of a single peak, that dUMP migrates identically to the PEF 
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byproduct. 

Figure 17. Reverse Phase HPLC Analysis of dCTP Conversion by PEF and rP45. Panel A, 
dCTP heated in 1x cloned Pfu polymerase buffer without additive (negative control), Panel B, dCTP 
heated in 1x cloned Pfu polymerase buffer with 700 ng of PEF. Panel C, dCTP heated in 1x cloned 
5 Pfu polymerase buffer with 5 a/I of rP45. Panel D, dUTP and dCTP heated in 1x cloned Pfu 

polymerase buffer with 700 ng of PEF. Chromatograms in panels A, B and D, were performed in the 
same series, while chromatogram C was performed in a different experiment, using a different C-18 
column. In the second experiment a dCTP standard was shown to elute at 9.6 minutes (data not 
shown). 

10 Figure 18. Reverse phase HPLC analysis of the products generated by heat-treatment of 

dCTP. dCTP (10mM in cloned Pfu DNA polymerase PCR buffer) was heated at 95»C in thin-walled 
tubes in a RoboCycler 40 temperature gradient block. Incubations were earned out for the following 
=, length of time: 0 (panel A), 1 hour (panel B), or 4 hours (panel C). 10//I of the reaction mixtures 
i were loaded onto a Waters Delta-pak C-18 column (300 angstrom/1 5/mi), equilibrated in 50mM 

js triethylammonium acetate, pH(7.0) (A). Products were eluted with acetonitrile (B) at 2 ml/min., using 
J the following gradient: 0% B for 5 minutes, 0-10% B over 20 minutes. Absorbence of the eluate was 

monitored with a photodiode an-ay detector, and peak areas were integrated at 260nm. To the right 
^ of panels A and C are shown the same chromatograms re-integrated at a higher sensitivity. In panel 
1 D, 10/1/1 of 10mM dUTP (in cloned Pfu DMA polymerase PCR buffer) was chromatographed. 
IpO Figure 19. Comparison of the absorbence spectrum of dCTP, dUTP, and the product 

p generated from heating dCTP. The absorbence spectra (200-300nm) of the following were 
=j superimposed: (1) peak 9.283 from chromatogram A in Figure 18 (dCTP unheated); (2) peak 12.383 

from chromatogram C in Figure 18 (heated dCTP product); (3) peak 12.100 from chromatogram D in 
Figure 18 (dUTP unheated) The three spectra were nonnalized such that heights of the maximally 
25 absorbing peaks (i4max) are equivalent. 

Figure 20. Reverse phase HPLC analysis of the products generated by PCR cycling dCTP. 
dCTP (lOmM in cloned Pfu DNA polymerase PCR buffer) was cycled alone (Panel A) or in the 
presence of 0.5 ng//A PEF (Panel B) or 1 .25U///I Pfu DNA polymerase (Panel C). Cycling was 
earned out in a RoboCycler 40 using the following conditions: 95°C 1 min./30 cycles of: dS'C 1 
30 min.,60''C 1 min.,72»C 10:24 min./ 72''C 10 min. 10//I of the reaction mixtures were 

chromatographed as described in the Figure 18 legend. In panel D, the absorbence spectra of 
dCTP, dUTP, and the product generated from PCR cycling dCTP are compared. The absorbence 
spectra (200-300nm) of the following were superimposed: (1) major peak at 8.3-8.8 min. from 
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chromatogram of dCTP + Pfu unheated; (2) peak at 1 1.867 min. from chromatogram of dUTP in 
buffer unheated; (3) peak 12.150 from chromatogram C (Figure 20) showing dCTP + Pfu cycled. 

Figure 21. dUTP Inhibition and R«5versible Inhibition with PEF and rP45. The following 
components were added to cloned Pfu DNA Polymerase PGR reactions (5.2kb "on/off" assay of 
example 1): Lane 1 , 2 //M dUTP + 7 ng native PEF. Lane 2, 2 fM dUTP. Lane 3, 2 pM PEF 
generated dUMP. Lane 4, 20 //M PEF generated dUMP. Lane 5, 2 //M dUTP + 180 ng rP45 clone 
1 . Lane 6, 2 //M dUTP + 20 ng rP45 clone 1 . Lane 7, 2//M dUTP + 2 ng rP45 clone 1 . Lane 8, 2 
fjM dUTP + 180 ng rP45 clone 23. Lane 9, 2 //M dUTP + 20 ng rP45 clone 23. Lane 10, 0.2/M 
dUTP. Lane 11, 0.2^^^! dUTP + 7ng native PEF. Lane 12, 0.2/M dUTP + 20 ng rP45 clone 1. 
Lane 13, 0.2/jM dUTP + 20 ng rP45 clone 23. In the right Lane M, 1 Kb DNA mari<ers were run. 

Figure 22. Inhibition of PGR with unrelated dU Gontaining Oligonucleotides. A 0.9kb target 
was amplified from human genomic DNA. Lanes 2 and 3 represent the PGR amplification without 
the addition of a third oligo. The sample in lane 3 was amplified in the presence of 7ng of PEF. 
Lanes 4-8 and 10-11 represent the titration of the control (dT rather than dU) oligonucleotide 
(40mer). Lanes 13 through 32 represent samples amplified in the presence of the dU 
oligonucleotide (40mer). Lanes 13-19 were amplified with Pfu DNA polymerase alone. Lanes 14 
through 26 were amplified with Pfu DNA polymerase in the presence of 7 ng of PEF. Lanes 27 
through 32 were amplified with Taq DNA polymerase. 

Figure 23. Human dUTPase mimics PEF/P45 activity. PEF activity was measured using the 
5.2kb primer-template assay described in example 1. At each annealing step of the PGR reaction, 
O.Sa/I of the following were added: dUTPase storage buffer (negative control), recombinant P45 (at 
2ng///l)(positive control), human dUTPase preparation undiluted (duplicate reactions), or diluted 1:10 
(duplicate reactions). 

Figure 24. Presence of PEF(P300) in P. furiosus DNA polymerase preparations. SDS- 
PAGE analysis is shown for six preparations of native Pfu DNA polymerase with varying levels of 
PEF present. The volumes of each lot loaded is indicated (/xl). The protein samples were not boiled 
prior to electrophoresis and the gels were silver-stained.. 

Figure ZS. Western blot analysis of crude extracts using P. furiosus PEF-specific IgG. 
Extracts were prepared from 5 different fermentations of P. furiosus (lanes 1-5), 3 partially purified 
fractions from T. aquaticus (lanes 6-8), and 1 extract from E. coli (lane 9). Purified PEF (550 ng) 
and pre-stained molecular weight mari<ers were mn in lanes 10 and 11, respectively. With the 
exception of the mariners, all samples were boiled in SDS/BME dye prior to loading. 

Figure 26. Western blot analysis of native PEF samples using anti-recombinant P45 

-12- 



antibody. Three samples (700 ng each) of native Pfu PEF were prepared as follows: unheated/2% 
SDS loading dye (lane 1); heated/2% SDS loading dye (lane 2); and heated/1 %TCA/2% SDS loading 
dye (lane 3). Heating was earned out for 5 minutes at 95°C. The samples were eiectrophoresed on 
a 4-20% gradient gel and transfen-ed to nitrocellulose. The blot was probed as described in the text 
with rabbit anti-rP45 sera. Novex prestained molecular markers (SeeBlue™) were mn adjacent to 
the PEF samples in lane 4. 

Figure 27. Western blot using the antibody to CBP-rP45 and cell lysates from different 
species. The lysates shown in lanes 2-7 were prepared by sonication. The samples were heated at 
95°C prior to loading except for the sample loaded in lane 10 which was left at room temperature. 
Lane 1, SeeBlue™ Markers (Novex), Lane 2, Uncharacterized Bacillus species (Magenta), lane 3, 
Thermus aquaticus, lane 4, uncharacterizied species, Lane 5, Pyrococcus furiosus, Lane 6, 
Methanobacterium thermoautotrophicum, lane 7, Human lymphoma HL60 cells, Lane 8 purified 
Pyrococcus furious PEF, Lane 9 purified fusion CBP-rP45. Lane 10 purified fusion CBP-rP45. 

Figures 28, 29, and 30. PGR enhancing activity of P. furiosus PEF in cloned Pfu DNA 
polymerase PCRs. PGR amplifications were perfomried as described in example 14 with the 
following additional notes. Figure 28. A 1.9 kb lac\-lacZa target was amplified from plasmid DNA. 
100A/I PGRs were conducted with 50 pg of pPRIAZ, 100 ng of primers (5 'GAT AGG GAA TTG GGA 
AAA GGT TTG GCG GTA TGG 3' (SEQ. ID NO: 20); 5*AGT AGG GAA TTG GAG GGA AAA TGG 
GGG TCA TGG 3' (SEQ ID NO: 21)), and 5U cloned Pfu DNA polymerase in the absence (duplicate 
samples #18) or the presence (duplicate lanes #19) of 0.5/il of a PEF-containing heparin sepharose 
fraction (H.S. #75; prep. 4; « 10 ng///l PEF). PGR cycling was conducted on a GeneAmp PGR 
System 9600 (Perkin Elmer Getus) using the following conditions: 30s at 95"*G (1 cycle)/ 5s at 95»G; 
1 min. at 55°G; 2.5 min. at 72''G (30 cycles). Figure 29. A lOkb target from lambda DNA was 
amplified. 100//I PGRs were conducted with 250 ng of lambda DNA (Sigma), 250 ng of primers 
(F51-20 5'GGG-GTT-TGG-GTT-GTT-GTT-GG 3' (SEQ ID NO: 22) R10163-20 5'GGA-TGT-GAG- 
GGG-GGA-GTT-TG 3' (SEQ ID NO: 23)), and 5U cloned Pfu DNA polymerase in the absence (lane 
1) or the presence of 1//i of a S200-purified PEF (prep. 3; 550 ng///l PEF) diluted 1:500 (lane 2), 
1 :50 (lane 3), or 1 :5 (lane 4). PGR cycling was conducted on a Robocycler 40 (Stratagene) using 
the following conditions: 95<»G for 1 min. (1 cycle)/ 95''G for 1 min.; 62*'G for 1 min.; 72''G for 10 min. 
(30 cycles). Figure 30. Lanes 1-5, a 5.2 kb portion of the human a1 antitrypsin gene was amplified 
from genomic DNA. 25//I PGRs were conducted with 62.5 ng of human genomic DNA (Promega), 50 
ng of primers (F91-23 5'GAG GAG AGG AGG AAA GGT GGA AG (SEQ ID NO: 24); R5271-21 
5'GGT GGG AGA AGA GTT GAG TGG) (SEQ ID NO: 25), and 0.8U cloned Pfu DNA polymerase in 
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the absence (lane 1) or the presence of Ijtfl of S200 purified PEP (SCS #52; 0.7 ug/ul) diluted 
1:1000 (lane 2), 1:10.000 (lane 3), 1:100,000 (lane 4) or 1:1.000,000 (lane 5). PGR cycling was 
conducted on a RoboCycler 96 (Stratagene) using the following conditions: 96'*C for 45s (1 cycle)/ 
96X for 45s; eO^'C for 45s; 72*'C for 14 min. (35 cycles)/72X for 10 min. (1 cycle). 

Figure 31. Effect of Stratagene's Perfect Match (£ coli ssb) on the specificity of PCRs 
conducted with cloned Pfu DNA polymerase and P, furiosus PEP. PCRs were conducted using the 
6.2kb test system (example 1) in 100//I reaction volumes. 1//I of the following were added to PCRs: 
lanes 1.5-dilution buffer; lanes 2,6- undiluted PEF-containing heparin sepharose fraction (microcon 
30-concentrated SCS #36 H.S. #78. prep. 2; «40 ng///l PEP); lanes 3,7- PEP fraction diluted 1:10; 
lanes 4,8- PEP fraction diluted 1:100. 1//I of Perfect Match was added to PCRs run in lanes 5-8. No 
DNA samples were loaded in the lanes between lanes 3 and 4, 4 and 5, and 7 and 8. 

Figure 32. SDS-PAGE analysis of heparin sepharose (H.S.) column fractions. The heparin 
sepharose fractions indicated (# at top) from SCS native Pfu DNA polymerase prep. #37 (SCS #37) 
were analyzed on 4-20% SDS-PAGE gels (4/til/lane). Prestained molecular weight markers were run 
in lanes denoted "m" and 4//I of SCS #36 H.S. #78 (PEP prep. 2) was mn in lanes marked "A". The 
peak of DNA polymerase (95kD) activity eiluted between fractions 50 and 61. 

Figures 33, 34, 35, and 36, Enhancing activity of P. furiosus PEP in PCRs conducted with 
Pwo (Figure 33), JDF-3 (Figures 34 and 35), ES4 and Vent (Figure 35), and Deep Vent (Figure 36) 
DNA polymerases. In Figure 33, 25fA PCRs were conducted using the 6.2kb test system (example 
1) with 25U/ml Pwo DNA polymerase and 1 x cloned Pfu PCR buffer. I^/I of S200-purified P. 
furiosus PEP (prep 1; 225 ng//il) was added undiluted (lane 1) or diluted 1:10 (lane 2), 1:1000 (lane 
3), 1:10,000 (lane 4). 1//I of dilution buffer was added as a negative control (lane 5). In Figure 34, 
a 10 kb lambda DNA target was amplified from lambda DNA (lanes 1-4; 5-7; 11-13) or mouse 
genomic DNA, containing 40 copies (lanes 8-10) or 1 copy (lanes 14-16) of a lambda DNA 
transgene. 100//I PCRs were conducted with 250 ng of lambda DNA (Sigma) or mouse genomic 
DNA, and 250 ng of primers (F51-20 5'GGC-GTT-TCC-GTT-CTT-CTT-CG (SEQ ID NO: 22); 
R1 01 63-20 5'CCA-TCT-CAC-GCG-CCA-GTT-TC) (SEQ ID NO: 23). PCRs were conducted in Taq 
PCR buffer using 5U Taq DNA polymerase (lanes 1-4) or 1U JDF3 DNA polymerase (lanes 5-16). 
1//I of the following was added to PCRs: S200-purified PEF (prep. 3; 550 ng///l PEF) diluted 1:500 
(lanes 2, 12, 15), 1:100 (lanes 6,9), 1:50 (lane 3), 1:10 (lanes 7.10,13,16), or 1:5 (lane 4). 1a/I of 
dilution buffer was added as a negative control (lanes 1,5,8,11,14). PCR cycling was conducted on 
a Robocycler40 (Stratagene) using the following conditions: 95^*0 for 1 min. (1 cycle)/ 95*'C for 1 
min.; 62°C for 1 min.; 72^*0 for 10 min. for Taq or 5 min. for JDF3 (30 cycles). In Figure 35, the 
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1.9kb ligase gene was amplified from P. furiosus genomic DNA. 100//I PCRs were conducted with 
250 ng of DNA and 250 ng of primers (5'(3AG CTT GCT CAA CTT TATC (SEQ ID NO: 26); 5'GAT 
AGA GAT AGT TTC TGG AGA CG) (SEQ ID NO: 27). PCRs were conducted with 10U ES4 DNA 
polymerase in Pfu PGR buffer (lanes 1,2), 1.5U JDF3 DNA polymerase in Taq PGR buffer (lanes 
3,4), 4U Pfu DNA polymerase in cloned Pfu PGR buffer (lanes 5,6), 1U Vent DNA polymerase in 
Vent PGR buffer (lanes 7,8), or 1U Taq DNA polymerase in Taq PGR buffer (lanes 9,10). of the 
following was added to PGRs: dilution buffer (lanes 1,3,5,7,9) or S200-purified PEF (prep. 3; 550 
ng///l PEF) diluted 1:100 (lanes 2,4,6,8,10). PGR cycling was conducted on a DNA Themnal Gycler 
480 (Perkin Elmer Getus) using the following conditions: 95''G for 1 min.; 46»G for 1 min.; 72°G for 2 
min. (30 cycles). In Figure 36, a 2kb DNA target was amplified from transgenic mouse genomic 
DNA. 100//I PGRs were conducted with 250 ng of DNA and 250 ng of primers (F51-20 5'GGG GTT 
TGG GTT GTT GTT GG (SEQ ID NO: 22); R2092-23 5'GGG GAT ATG GAG ATT TGT GGA GG) 
(SEQ ID NO: 28). PGRs were conducted with 0.75U Deep Vent DNA polymerase in Deep Vent 
PGR buffer (lanes 1^). ^^^\ of the following was added: dilution buffer (lane 1) or S200-purified PEF 
(prep. 3; 550 ng///l PEF), diluted 1:500 (lane 2), 1:100 (lane 3), 1:50 (lane 4). PGR cycling was 
conducted on a Robocycler40 (Stratagene) using the following conditions: 95''G for 1 min.(1 
cycle)/95''G for 1 min.; 62°G for 1 min.; 72''G for 2 min. (30 cycles). 

Figure 37. Enhancement of RT-PCR with P. furiosus PEF. A portion of the EGF receptor 
sequence was amplified from HepGj total RNA using the 2-step RT-PGR protocol described with the 
following additions. 1/ii of a cDNA synthesis reaction was added to 50//I PGR reactions containing 
5U Pfu DNA polymerase, 1//M antisense primer (5' GAG-TTA-AAT-GGG-TAG-AGT-GTA-TGT) (SEQ 
ID NO: 29), 1//M sense primer [5' GAG-GAG-TGA-GAA-GGT-GGT-ATG-GAA (SEQ ID NO: 30) (1kb 
product) or 5'GTG-GAG-GTG-GGG-TGT-AGG-ATT-TGT (SEQ ID NO: 31) (3kb product)]. 1//I of 
buffer (0) or 1^ of a PEF-containing heparin sepharose fraction (SGS #37 H.S. #75; prep. 4; «10 
ng///l PEF) was added undiluted (UD) or diluted 1:10 to 1:10® (as indicated). PGRs were conducted 
in cloned Pfu PGR buffer, containing 3mM MgS04. 

Figure 38. Enhancement of Pfu DNA polymerase-based Seamless Gloning with P. furiosus 
PEF. 20 ng-of plasmid was amplified as described in the Seamless Gloning kit protocol in the 
absence (lanes 2,4,6,8,10,12) or presence (duplicate lanes 1,3,5,7,9,11) of S200-purified P. furiosus 
PEF (prep. 3; 550 ng///l), diluted 1:100. A 7.2 kb target was amplified with 6 different primer pairs 
(each set of 3 lanes). An extension time of 7.5 minutes was used for PEF-containing reactions, 
while an extension time of 15 minutes was used for reactions lacking PEF. 

Figure 39. Enhancement of Pfu DNA polymerase-based QuikGhange mutagenesis with P. 
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funosus PEF. QuikChange mutagenesis was performed using the kit control primers and plasmid 
template, with either native or cloned Pfu DNA polymerase. To the reactions was added 1pl of 
dilution buffer or l^/l of a PEF-containing heparin sepharose fraction (SCS #36 H.S. #78; prep. 2 « 
40 ng///l), diluted as indicated. The relative amount of linear amplification product was assessed by 
5 the intensity of ethidium bromide-stained product bands on agarose gels, Supercompetent E. coli 
cells were transfomned with the digested amplification products. The number of amp resistant 
colonies and the mutation frequencies were scored. 

Figures 40 and 41. PGR enhancing activity of S200"purified P. funosus PEF. PGR 
enhancing activity was measured in duplicate assays using the 6.2 kb test system described in 
10 Example 1. In Figure 40. purified PEF (225 ng///l; prep. 1) was diluted in 1 x cloned Pfu PGR buffer 
and ^JJ\ aliquots of the following were added to 24//I PGRs- lane 1, buffer; lane 2. PEF undiluted; 
lanes 3-6, PEF diluted 1:10, 1:100, 1:1000, 1: 10,000, respectively. In Figure 41, the following 
amounts of purified PEF were added: lane 1- 1//I of buffer, lane 2- 1//I PEF neat, lanes 3-8, 1//I PEF 
q diluted 1:10, 1:100, 1:1000, 1:10,000, 1:100,000, and 1:1,000,000. 

[IPS Figures 42 and 43. PGR enhancing activity of heparin sepharose-purified P furiosus PEF. 

\^ PGR enhancing activity was measured using the 6.2 kb test system described. The PEF fraction 

(«40 ng//t/l; prep. 2 in text) was diluted in 1 x cloned Pfu PGR buffer. In Figure 42, 1//1 aliquots of 
==y the following were added to 50|/l PGRs- lane 1, buffer; lanes 2-5, PEF diluted 1:10, 1:100, 1:1000, 1: 
P 10,000, respectively. In Figure 43, 1//I aliquots of the following were added to 24/il PGRs- lane 1, 

5|0 buffer; lane 2, PEF undiluted; lanes 3-6, PEF diluted 1:10, 1:100, 1:1000, 1: 10,000, respectively. 

DNA mari<ers were electrophoresed in lane "m". 
^ Figure 44. PGR enhancing activity of heparin sepharose column fractions. The PGR 

enhancing activity contained in 1//I of each column fraction (fractions 1-93 from Figure 32; numbered 

at top) was measured using the assay 6.2kb primer-template PGR described in Example 1. 
25 Fractions 50-61 contained the peak of Pfu DNA polymerase activity and were excluded from 

analysis. 

Detailed Description of Specific Embodiments of the Invention 

The following description should not be construed to limit the scope of this invention to any 
specifically described embodiment. Various aspects and embodiments of this invention will be 
30 apparent from the disclosure as a whole in context with the knowledge of one skilled in the art. In 
addition, the description herein, in combination with infomiation known or available to persons of 
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ordinary skill in the art, enables the practice of the subject matter encompassed by the following 
claims. 

For the purposes of this invention,, a nucleic acid replication reaction can mean any of the 
numerous nucleic acid amplification, primer extension, reverse transcription, or other nucleic acid 
5 polymerization reactions known in the art. Additionally, a replication reaction of this invention 
includes any reaction in which the ability of an enzyme to interact with a first nucleic acid and 
generate a second, substantially complementary nucleic acid sequence, is involved. The 
amplification reactions of this invention are not limited to PCR processes or any particular PCR- 
based assay, although they are particulariy useful herein, and specifically include RT-PCR 
10 processes. The proteins, preparations, compositions, mixtures, kits and methods of this invention 
can be used with any appropriately designed nucleic acid replication reaction. 

As used herein, the iem "PEF" refers to a naturally occuning protein derived from a 
bacterial, eukaryotic, or archael source (or a wholly or partially synthetic copy or analog thereof) 
,g having polymerase enhancing activity, mixtures of one or more such proteins, protein complexes 

I^JJlS containing one or more such proteins, or extracts containing one or more of such proteins, mixtures 
M or complexes. Recombinant PEF proteins, as a wholly synthetic copy of a naturally occuning 
lV protein, and complexes with at least one recombinant PEF protein are also "PEFs" according to this 
Ul invention. The Pfu P45 and P50 proteins of this invention are illustrative of PEF proteins, which 

exhibit an apparent molecular weight of approximately 45kD and 50kD and are predominant 
SiZO components of a PEF complex derivable from Pfu. Data relating to both the P45 and P50 proteins 
I is presented herein and details specific stnjctural infomnation. On SDS-PAGE, the non-heat-treated 

Q PEF complex containing P45 and P50 as well as minor additional components migrates with an 

apparent molecular weight > 250kD. One species of the PEF complexes of this invention is the 
P300 complex from Pyrococcus furiosus, 
25 The present invention is intended, however, to encompass other PEF proteins, mixtures, 

complexes, compositions, and extracts derived from organisms other than Pfu identified by 
techniques analogous to those provided by the following examples, or by use of the stnjctural 
information on the PEF proteins described herein or derivable from the proteins described herein. 
More specifically, the invention is intended to encompass PEFs identified on the basis of sequence 
30 homology to all or part of the PEFs described herein, including nucleic acid homology to ail or part 
of the DNA sequence encoding the proteins described herein or the DNA sequences described 
herein. Computer-implemented homology searches using the sequence information herein, stored 
on an appropriate computer-readable medium, with procedures known in the art, can generate these 
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homologous PEFs. Also, reactivity with antibodies to the proteins, complexes, or extracts disclosed 
herein can be used with procedures known in the art to generate homologous PEFs. 

One skilled in the art is familiar with methods of generating analogs of proteins. Various 
techniques from publications in the art can be used to mutate, modify, tmncate, or othenwise change 
5 a protein's amino acid sequence and retain functional activity. In the case of a dUTPase activity as 
PEF, the known crystal structure of dUTPases, such as £. coli dUTPase, provides specific 
information on regions of a dUTPase that may be mutated in any of a number of ways while still 
retaining PEF activity (see Cedergren-Zeppezauer, E. S., ef a/., Nature 355: 740(1992), which also 
notes similarities to mammalian dUTPases in Hokari, S., et a/.. Arch. Biochem. Biophys. 253: 350 
10 (1987)). Similarly, wholly or partially synthetic or recombinantly expressed proteins can also be 
generated from the infomfiation herein by those skilled in the art. (For example, Ausubel et al. 
(1989) Cun-ent Protocols in Molecular Biology, and supplements through Febmary 1997.) 
3 Furthermore, a PEF can also be a protein exhibiting a dUTPase activity, as demonstrated 

0 herein. Specifically, human, Pyrococcus furiosus, and Thermos thermophilis dUTPase activities can 
J 5 be used to enhance polymerization reactions as a PEF. Other eukaryotic, bacterial, and archael 
J sources can similarty be used to derive a PEF as a dUTPase protein. A PEF can also be any 

protein that functions to turn-over dUTP or lower the concentration of dUTP. Similariy, the Invention 
comprises a method of enhancing replication or amplification reactions by reducing the dUTP 
3 concentration or preventing the incorporation of dUTP into replicated or amplified products, as well 
fo as compositions that are capable of preventing that incorporation. 

p The polymerase enhancing activity of the PEFs of this invention can be detennined in a 

J number of different ways. The description below details a few examples of assays and techniques 
one skilled in the art can use to detemnine if PEF activity is present. These assays and techniques 
can be used alone or in combination. 

25 Example 1 specifically details screening assays and the "on/off* assay. This type of PGR 

assay allows one to identify the presence of a polymerase enhancing activity in a sample. More 
generally, any assay that shows an increase in PGR product yield, over a negative control level, 
when a sample suspected to contain a polymerase enhancing activity is added can be used to 
identify a polymerase enhancing activity. Also, any assay that shows an increase in processivity, 

30 over a control level, reflected by the increased length of PGR products being generated when a 
sample suspected to contain a polymerase enhancing activity is used. A combination of PGR 
product yield and increased processivity can also be used to detennine whether or not a polymerase 
enhancing activity is present. 
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A polymerase enhancing activity can also be identified by assays that indicate a reduction in 
the PCR inhibitory action of incorporated dUTP. For example, PGR reactions can be conducted in 
the presence of dUTP and samples suspected of containing polymerase enhancing activity. Those 
reactions that allow polymerization in the presence of dUTP indicate a polymerase enhancing activity 
5 in the forni of a dUTPase activity. Thus, a dUTPase activity can be a polymerase enhancing activity. 

Also, a composition that functions to turn-over dUTP, especially under thennophilic reaction 
or PCR reaction conditions, can be a polymerase enhancing activity as a dUTPase. An enzyme or 
activity that acts on dUTP so that it is not Incorporated into a newly polymerized strand functions to 
turn-over dUTP. The turn-over of dUTP can also be detected by an assay for the conversion of 
10 dUTP into dUMP, as detected by analyzing the reaction products by HPLC, for example. 

Biochemical assays that detect the conversion of dUTP into dUMP, or other nucleoside phosphate or 
metabolic derivatives or products, can be devised or are known in the art and can be used to identify 
□ polymerase enhancing activity as a dUTPase activity. 

;|:jf A polymerase enhancing activity can also be a dUTPase enzyme that possesses the 

Ld5 consensus uridine-binding sequence motif (SEQ ID NO.: 72). A number of those enzymes are 

]Z identified below. However, numerous others exist or can be identified through computer- 

M= implemented or other sequence analysis pnDcedures known in the art. Thus, the presence of the 

'y" consensus uridine-binding motif or the related sequences shown herein can also be used to define 

Q an enzyme or protein that is a PEF, such as a protein that comprises SEQ ID NO.: 72, or any one of 

1=^0 SEQ ID NOs.: 72-81, or combinations of these sequences. 

Also, proteins identified through sequence identity comparisons known in the art can be used 
to confimn the presence of a PEF. For example, proteins from one species possessing a sequence 
identity of approximately 18% or greater have been shown in the art to be related to or analogous to 
the known protein of another species. In the examples below, a sequence similarity of 
25 approximately 39% suffices to positively identify a dUTPase activity that can act as a PEF. 

The antibodies to PEF described herein can also identify a protein with polymerase 
enhancing activity. For example, Western blot analysis of compositions from various archeal, 
bacterial, thermophilic bacterial, or eukar/otic samples can identify a protein that possess 
polymerase enhancing activity. Furthemnore, as the PEF proteins and complexes of this invention 
30 are demonstrated as immunogenic, various other antibodies to PEF may be produced by techniques 
known in the art with the infomnation herein. These other antibodies can also be used to identify a 
PEF. 
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Example 1 
Screening for PEF Activity 

Protein-containing extracts from a number of different sources can be tested for PEF activity. 
The extracts can be prepared in a number of ways known in the art. 
5 One method was demonstrated with Pfu DSM 3638 cells. The cells were grown, a cell paste 

collected by centrifugation and then frozen at -80*'C. The paste was taken up with lysis buffer [50 
mM Tris-HCI (pH 8.2), 1 mM EDTA, 10 mM B-mercaptoethanol, 0.5mM PMSF, and 2fjg/m\ 
aprotinin], and thereafter the cells were lysed in a French press and then sonicated. Following 
sonication, the lysate was centrifuged and the supematant, containing potential PEFs, was collected 
10 for assays. 

Extracts from any cell producing a PEF, for example, cells transfected with a recombinant 
vector directing the expression of a PEF, can also be assayed. Methods of making extracts of these 
cells are known in the art and are exemplified below. 

y 1. Screening Assays for PCR-Enhancing Activity 

^5 One method of detecting thermostable PEFs is by screening partially-purified fractions from 

thermophilic archeal or bacterial extracts for PGR enhancing activity, PGR enhancing activity can be 
- detected in samples consisting of column-purified fractions as well as homogeneous protein samples 
3 and proteins recovered by elution from SDS-PAGE gel slices (see below). Samples are added to 

^ PGR amplification reactions containing DNA polymerase, buffer, dNTPs, primers, and DNA template. 
;f 0 PGR enhancing activity is identified by an increase in PGR product yield for amplifications conducted 
^ in the presence of a particular sample (DNA polymerase + PEF) as compared to amplifications 

conducted in the absence of added sample (DNA polymerase only). 

When screening samples suspected of containing endogenous DNA polymerase activity, for 

example protein extracts, negative controls can be performed in which the exogenous DNA 
25 polymerase has been omitted from the PGR amplifications. In addition, when screening samples 

contaminated with DNA, negative controls can be canned out in which exogenous DNA template is 

omitted from the PGR amplifications. 

The sensitivity of the PGR enhancing assay is dependent on the complexity of the DNA 

targets employed. PGR reaction parameters (target complexity, DNA template concentration, 
30 polymerase concentration, PGR cycle number or extension time) can be adjusted so that the yield of 

PGR product is barely detectable under normal conditions. In addition, samples for testing can be 

diluted appropriately so that the concentration of PEFs falls within the detectable range of the PGR 
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A second embodiment of an assay to screen for PEF employs, for example, the 5.2 kb 
human crl -antitrypsin gene in a PGR amplification. PGR amplification of this primer/template system 
was so limited that, in the absence of PEF, it was difficult to detect any PGR product. With added 
PEF activity, a 5,2 kb product was easily detected. The following conditions were used for this 
5 "On/OfT' assay: In 100 a/I - 1x Cloned Pfu DNA polymerase buffer, 200 a/M each dNTP, 200 ng 

primer F-91-23, 200 ng primer R5271.21, 125 ng Human Genomic DNA, 2.5 units cloned Pfu DNA 
polymerase, +/- PEF or recombinant P45 (rP45). 

Primer F91-23 5' GAGGAGAGCAGGAAAGGTGGAAC 3' (SEQ ID NO: 64) 
Primer 5271-21 5' gctgggagaagacttcactgg 3' (SEQ ID NO: 65) 

10 The PGR cycling conditions were as follows: 95**G for 1 minute (1 cycle), 95*»G for 1 minute - 60°G 
for 1 minute - 72°G for 10 minutes (30 cycles). After completion, the reactions are run out on an 
electrophoresis gel and the quantity of reaction products determined by any of a number of methods 
known in the art. 

3. Screening Assays for Nucleic Acid Replication Enhancing Activity 

i^^15 Extracts can also be added to any nucleic acid replication reaction to detemiine PEF activity, 

y; Many of these reactions are known in the art, including primer extension reactions, DNA sequencing 

ul reactions, site-directed mutagenesis readions, and a number of PCR-based reactions. (Ausubel, 

Q F.M., et al. (1989) Gun^nt Protocols in Molecular Biology, Greene Publishing Associates and Wiley- 

1 Interscience, New York, NY; Sambrook, J., et al. (1989) Molecular Gloning: A Laboratory Manual, 

^20 2nd Ed., Gold Spring HariDor Laboratory Press, Gold Spring Harbor, N.Y.) By comparing the results 
=3 produced in a nucleic acid replication reaction with and without the added extract, one can identify 

the presence of PEF. 

Example 2 
Purification of PEF from R furiosus 

25 Once PGR enhancing activity has been detected from, for example, archeal or bacterial 

sources, large amounts of purified PEF can be obtained by column chromatography. The following 
protocol was developed for purifying PEF from P. furiosus (Pfu), However, one skilled in the art will 
appreciate that other cells or species could be used as well. 
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1. Cell Growth and Lysis 

P. furiosus DSM 3638 cells were grown in a 400 liter fermentor according to established 
protocol. (U.S. Patent No. 5,545,552, specifically incorporated herein by reference.) The cell paste 
was collected using a Sharpies in-line centrifuge after approximately 20 hours (A600 «0.5), and 
5 then immediately frozen in liquid and stored at -80**C until use. Then, 500 grams of frozen cell 
paste was transferred to a 4 liter stainless steel beaker on ice. The cells were resuspended with 2 
liters of lysis buffer, consisting of 50 mM Tris-HCI (pH 8.2), 1 mM EDTA, 10 mM B-mercaptoethanol, 
0.5mM PMSF, and 2/ig/ml aprotinin. The cells were lysed in the French press using 2 passes at 8K 
RSI and the lysate was then sonicated for 10 minutes. Following sonication, the lysate was 
10 transferred to 400 ml bottles, spun for 1 hour at 9K rpm in a Sorvall RC-2B centrifuge using a 
Sorvall GS3 rotor, and the supematant collected. 

2. Purification by Column Chromatography 

J The supematant was loaded at a flow rate of 5ml/min. onto a 10 x 5 cm Q-Sepharose Fast 

Flow™ (Pharmacia) column («392 mis), pre-equilibrated in buffer consisting of 50 mM Tris-HCI (pH 

hJ5 8.2), 1 mM EDTA, and 10 mM B-mercaptoethanol. The column was washed with 2 column volumes 
of buffer, and the pass-through and column washes were collected and pooled. The pooled 

Ul fractions were adjusted to pH 7.5 using IN HCI. 

rj The Q-Sepharose pass-through was then loaded at a flow rate of 5 ml/min. onto a 5 x 1 1.5 

W cm («225 mis) SP Sepharose Big Bead™ (Pharmacia) column, equilibrated in buffer containing 50 
|0 mM Tris-HCI (pH 7.5), 1 mM EDTA, 1 mM DTT, 10% (v/v) glycerol, 0,1% (v/v) Igepal CA^30, and 
□ 0.1% (v/v) Tween 20. The column was washed with equilibration buffer until the absoriDance (ODjgo) 
approached baseline. The column was eluted with a 2 liter gradient from 0 to 250 mM KCI (in 
equilibration buffer). Fractions of 20 ml were collected, and aliquots removed from every third tube 
for SDS-PAGE analysis. 

25 Some of the fractions analyzed by SDS-PAGE showed a band > 250kD when a sample was 

not heated prior to electrophoresis («300kD). The fractions containing the 300 kD band were 
pooled and dialyzed overnight against 2x4 liters of Buffer A [50 mM Tris-HCI (pH 8.2), 1 mM 
EDTA, 1 mM DTT, 10% (v/v) glycerol, 0.1% (v/v) Igepal CA-630, and 0.1% (v/v) Tween 20]. The 
dialyzed pool was loaded at a flow rate of 2 ml/min. onto a 2.6 x 29 cm (== 154 mis) Heparin 

30 Sepharose CL-6B™ (Phamiacia) column, equilibrated in Buffer A. The column was washed with 1 
liter of Buffer A, and then eluted with a 1 .5 liter gradient from 0 to 300 mM KCl/Buffer A. Fractions 
of 10 ml were collected, and aliquots removed from every third tube for SDS-PAGE analysis. 
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Fractions containing the 300kD band were pooled and dialyzed overnight against 2x4 liters of 
Buffer A. 

The heparin sepharose-purified pool was loaded at a flow rate of 0.5 ml/min. onto a 1.6 x 95 
cm («191 mis) Sephacryl S-200 High Resolution™ (Pharmacia) column equilibrated in Buffer A 
5 containing 100 mM KCI. Then, 2 ml fractions were collected and aliquots removed from every third 
tube for SDS-PAGE analysis. Fractions containing the 300kD band were pooled and dialyzed 
overnight against 1 liter of buffer containing 50 mM Tris-HCI (pH 8.2), 0,1 mM EDTA, 1 mM DTT, 
50% (v/v) glycerol, 0.1% (v/v) Igepal CA-(530, and 0.1% (v/v) Tween 20. The purified protein was 
stored at -20°C. The purification protocol described above yielded «1 mg, of relatively 
10 homogeneous P300 band from 500 g. of cell paste. 

3. Purification of PEF from SDS-PAGE gels 

PEF in a heterogeneous sample can be identified by eluting purified protein from SDS-PAGE 
J gel slices and rescreening for PGR enhancing activity. This method allows rapid assessment of the 

number of PEF proteins in a particular sample and identification of their apparent molecular weight. 
;j^5 P. furiosus protein samples with PGR enhancing activity were electrophoresed on 4-20% 

;V acrylamide/2.6% bis-acrylamide Tris-Glycine gels (Novex), along-side pre-stained molecular weight 

.y maricers. Samples were loaded in the presence of 2% SDS, but were not boiled in order to prevent 
5^1 dissociation of PEF complexes. The gels were mn in Tris-Glycine buffer containing 1% SDS, and 
;| after electrophoresis, were washed briefly in 20mM Tris-HGI (pH 8.0)/1mM EDTA. Then, 2-3mm gel 

go slices were excised from the lanes of interest using the pre-stained molecular weight mari<ers as a 
;3 guide. Each gel slice was cut-up with a razor blade and the pieces transfen^ed to 50//i of elution 

buffer (20mM Tris-HGI (pH 8.0)/1mM EDTA/0,1% Tween-20). The slurry was incubated at 72<'C for 

30 minutes. 

Gel slices containing PEFs are identified by testing the eluates for PGR enhancing activity. 
25 Eluates containing >0.1 ng/fA PEF are then re-analyzed on silver-stained SDS-PAGE gels to verify 
the apparent molecular weight of the predominant protein component. The gel slice eluates are 
boiled in the presence of 2% SDS before loading and the apparent molecular weights of PEF 
proteins detemriined relative to protein standards. The gel slice elution procedure described here 
allows recovery of 1-10% of the protein of interest. 
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Example 3 

Identification of the PCR Enhancing 
Activity in P. furiosus Partially-Purified Column Fractions 

The fractions collected after the heparin sepharose chromatography were analyzed for PEF 

5 activity using the PCR screening assay (Example 1). The addition of diluted heparin sepharose 

fraction dramatically increased yields of PCR products generated with cloned Pfu DNA polymerase. 

The PCR enhancing activity of the fractions was shown to be dependent upon the presence of 

exogenous DNA template (Figure 1) and Pfu DNA polymerase (Figure 2). Increased PCR product 

yield was, therefore, not due to the presence of contaminating DNA template or native Pfu DNA 

10 polymerase, but rather to the presence of PEFs. 

In order to further characterize the factor or factors responsible, the following was performed. 

PEFs after heparin sepharose chromatography were identified by screening SDS-PAGE gel-purified 

samples for PCR enhancing activity, as discussed above in Example 2. When the protein samples 

^1 were loaded onto SDS-PAGE gels without pre-boiling, PCR enhancing activity (Figure 3) was 

Ms recovered in 2 gel slices from the gel of Figure 4. One gel slice (gel slice #1) was excised from a 

;i position between the 42 and 60kD mari<ers, while the second gel slice (slice #2) was recovered from 

a site just above the 250kD maricer (Figure 4). 

y The proteins eluted from the gel slices were also screened for DNA polymerase activity to 

;„ demonstrate that PCR enhancing activity was not related to contaminating DNA polymerase activity 

lo (Figure 5). The results indicated that SDS-PAGE purified proteins with PCR enhancing activity lack 

-ff significant DNA polymerase activity. Moreover, SDS-PAGE purified Pfu DNA polymerase lacks PCR 

;3 enhancing activity when tested at protein concentrations comparable to or greater than those of gel- 

purified PEFs. 

The P. furiosus PEF proteins were conclusively identified by re-analyzing gel purified 
25 samples with PCR enhancing activity on silver stained SDS-PAGE gels. In the absence of pre- 
boiling, the predominant band in gel slice #2 migrates somewhat slower than the 250kD molecular 
weight mari<er. consistent with the site where the gel slice was recovered (Figure 6). This band is 
called P300 or PEF complex. In addition, a minor band is evident at 50kD, called P50. Gel slice #1 
contained a major band at 50kD and a minor band at 45kD. 
30 However, when the proteins eluted from gel slice #2 are boiled in SDS prior to loading, the 

predominant component migrates with an apparent molecular weight of 50kD (Figure 6). A minor or 
pooriy staining component of »45kD is also visible. These results are consistent with P. furiosus 
PEF consisting of two distinct proteins, with apparent molecular weights of 50kD and 45kD, which 
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aggregate at low temperatures in presence of SDS to produce a complex which migrates as a 
discrete band at 300kD. 

Example 4 

Characterization of PEF Complex and Protein Components of the Complex 

5 P. furiosus PEF fractions following S200 chromatography comprises a mixture of proteins. A 

discrete band in SDS-PAGE migrates above the 250kD marker when the sample is not heated prior 
to electrophoresis. This protein is called P300 or PEF complex. When the conditions are changed 
to substantially dissociate the PEF complex, the subunit protein components of the PEF complex are 
visualized in SDS-PAGE. 

10 1. Temperature*dependent Dissociation of PEF Complex 

„ One method of dissociating PEF complex into its component proteins is heat treatment. In 

g the absence of heating, the majority of PEF migrates as a complex, mnning slightly slower than the 

■5?: 

250kD molecular weight mari<er. Minor amounts of dissociated P50 and P45 are visible in the 
^ unheated sample. However, after heat treatment at temperatures of about 85**C or higher, the PEF 

complex is completely dissociated as indicated by the absence of the 300kD band by SDS-PAGE. 
4 The predominant protein component of PEF complex, as visualized by silver-staining, exhibits an 
^ apparent molecular weight of approximately 50kD in SDS-PAGE. The P50 band is shown in the gel 
a of Figure 7, where the protein was heated to >85*'C in the presence of 2% SDS and BME prior to 
I loading. In addition, the P45 protein shov/n to be present in the gel-purified PEF complex (Figure 6) 
20 is also evident in the heat-treated, S200-purified PEF sample (Figure 7). Furthemiore, minor 

components with apparent molecular weights of approximately 37, 42, 55, 60, 85, 100, and 150kD 

were also detected in SDS-PAGE analyses of PEF complex treated at temperatures of 85-100*^0. 

These minor components may represent additional fonns of P50 and P45 generated by heat 

treatment (e.g. dimers, trimers, fragments) or minor unrelated species. 

25 2. Polymerase Enhancing Activity of PEF Complex and 

Component Proteins and Mixtures 

Protein components of S200-purified PEF were purified by SDS-PAGE and the PGR 

enhancing titer was detennined by adding serial dilutions of each gel slice eluate to PGR reactions 

with cloned Pfu DNA polymerase PGR reactions. The protein or protein mixtures which exhibited the 

30 highest levels of polymerase enhancing activity were identified by mnning the eluates on silver 
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stained SDS-PAGE gels. Analysis of SDS-PAGE gel slice eluates indicates that PGR enhancing 
activity of S200-purified R furiosus PEF can be attributed solely to the 451cD species plus the 50kD 
species. When an S200-purified preparation was loaded in the absence of heating (Figure 7, prep. 
1), PGR enhancing activity was present in gel slices recovered just above the 250kD marker and 
between the 42 and 60kD markers. When heated to 85°C before loading, PGR enhancing activity 
migrated between the 42 and 60kD markers (Figure 8). 

Titration experiments showed that the PGR enhancing titer of gel purified proteins migrating 
with apparent mass between 42kD and 60kD was not significantly different from that of the gel- 
purified PEF complex (300kD band). The^ levels of PGR enhancing activity migrating between the 
148 and 60kD markers were insignificant, thereby indicating that the 85kD, lOOkD, and 150kD bands 
do not contribute substantially to full PGR enhancing activity of the PEF complex (P300). Moreover, 
these components do not appear to further enhance the activity of PEF. 

Figure 9 shows the proteins recovered from 4 gel slices between the 42 and 60kD markers 
from 2 heated treated PEF samples. The greatest PGR enhancements were observed for protein 
samples njn in lanes 5, 9. and 10. These lanes contained the highest amounts of P45, in addition to 
low but detectable amounts of P50. Relative P50 concentration did not necessarily con-elate with 
highest PGR enhancing titer, as the proteins mn in lane 4 (where only P50 is visible) exhibited a 
lower titer than the protein mixture in lane 5 (same amount of P50, plus P45). Moreover, samples in 
lanes 9 and 10 exhibited similarly high PGR enhancing titers and levels of P45. but the sample in 
lane 9 contained 10 to 1000-fold more PJ50 than the sample in lane 10. These results are consistent 
with P45 being the most active component of the PEF complex. Since all samples of gel purified 
P45 isolated contain varying concentrations of P50, the exact biochemical role a low concentration of 
P50 has on attaining full PEF activity or stability could not be determined by this method. 

Example 5 

Amino Acid Analysis of PEF Complex and P50 and P45 Components 

The complex and the predominate 50kD component (P50) and 45kD (P45) component from 
Pfu were sequenced at the N-temninus. In addition, N-terminal sequence analysis was perfonned on 
the minor 100 and 150kD components generated upon heat dissociation. Two analyses were 
performed. In the first study, heparin sepharose-purified PEF samples («20% homogeneous; prep. 
4 in Figure 10) were electrophoresed and electroblotted onto PDVF filters. Samples were loaded 
onto 4-20% SDS-PAGE gels at room temiperature or after heat treatment, to allow recovery of both 
the 50kD protein and the >250kD complex (Figure 11). Blots were sent to Wistar Sequencing 
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Facility (Philadelphia, PA) for analysis. N-terminal sequence analysis was perfonned on both the 
50kD (P50) protein (heated SDS-PAGE sample) and the >250kD PEF complex (unheated SDS- 
PAGE sample). 

The N-terminal amino acid sequences of the PEF complex (P300) and the 50kD component 
5 (Pfu P50) were found to be substantially identical (Table A). This data confirmed that Pfu P50 is the 
predominant component of the PEF complex. Two distinct sequences were found for both P300 and 
P50 (1° and 2**), suggesting that Pfu PEF may contain 2 different 50kD species which co-migrate, or 
that the PVDF strip containing the 50kD species was contaminated with the 45kD species or other 
species visible by SDS-PAGE. 
0 In addition to the N-temiinal sequencing, the 50kD protein was also subject to in situ trypsin 

digestion and microbore reverse HPLC. A subset of tryptic peptides was analyzed by mass spec. 
Two peptides with single masses (#107, #112) and one peptide with two masses (#108) were 
chosen for sequence analysis. Two internal peptide sequences from Pfu P50 were recovered (Table 
A; Tryptic Peptides). Peptide #112 was 24 amino acids in length and the calculated mass of the 
5 Edman sequence (2530.8) was in very good agreement with the observed peptide mass (2531.7). 
Peptides #107 and #108 contained multiple sequences which could not be sorted by Edman 
sequencing alone. However these peptide fractions eluted very close together on microbore HPLC 
and contained several residues in common. Based upon shared sequence and mass analysis, a 
tentative sequence was assigned (107/108; Table A). 
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Table A - Analysis 1. 
N-terminal Sequencing 



Protein 
Sample 



Mass (Da) 
Obs. 



PEF complex «50,000 



P50 



««50, 000 



Tryptic Peptides 



Peptide 



107 
108 

107/108 



112 



Mass (Da) 
Obs. 



1° 
2° 

1° 

2<> 



Calc. 



1389.59 N/A 
1659.1,1910.63 N/A 



Sequence/ (SEP ID NO: 



XLLHH VKLIY ATXXR 
XXXPD WXXRX EXLXX 



2531.73 



(1) 
(2) 



MLLHE VKLIY ATKSR (3) 
RLVGK KIVLA IPGXI AAVEP 
XXXPD WSXRX EXLGE KFY (4) 



Sequence/ (SEP ID NO: ) 



1910.3 KYDAV IMAAA WDFR PK (5) 



2530.8 ADLW GNTLE AFGSE (6) 
ENQW LIGR 



Comments 



multiple sequences 
multiple sequences 
AAs common to 
107/108 



"X" represents any amino acid 

underlined residues represent amino acids that may be deleted or substituted with any amino 
acid but are tentatively assigned as indicated 



The 35 amino acid sequence recovered from the N-temiinus of Pfu P50 (SEQ ID NO.: 3), 
and the two internal peptides of 17 and 24 amino acids (SEQ ID NO.:s 5 and 6), represent 
approximately 16% of the total amino acid sequence of Pfu P50, assuming an apparent molecular 
weight of 50kD and a length of approximately 454 amino acids. 

In the second round of analyses, the N-temiinal sequences of the 150, 100, 50, 45, and 42kD 
species were detemfiined from a PVDF blot of heated S200-purified PEF (Figure 12 and Table B). 
This analysis was perfomied by the Beckman Research Institute of the City of Hope (Duarte, CA). 
The N-temriinal sequences of the 150 and lOOkD species were the same and identical to the major 
sequence in the 50kD band (except for ambiguity at the N-terminus) and similar to the minor 
sequence in the 45kD band. In addition, the sequence was very similar to the major sequence in 
the PEF complex and the 50kD band detemnined in the Wistar sequence analysis. The data are 
consistent with the 150 and lOOkD species being alternative forms of the 50kD species (e.g., dimers, 
trimers, or aggregates). The major N-tenminal sequence of the 45kD band (2 blot sections analyzed; 
"upper" and "lower") was distinct from the P50 sequence and very similar to the minor sequence 
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found in the 50kD bands analyzed by both Wistar and Beckman and in the PEF complex analyzed 
by Wistar. No N-temninal sequence was recovered for the 42kD species. In total, these data are 
consistent with the PEF complex consisting of 2 distinct protein components. P50 and P45. 



Table B - Analysis 2. 



Protein san^lc 

(MW) 
ISOkD 
lOOkD 
50 

45 (upper) 

45 (lower) 
42 



1° 
no 



sequence 



Segucince 

(GAM)IiHHV KLIYA TKLRK (7) 

(GAM)LHHV KLIYA TK(KL)RK (8) 

M LHECV KLIYA TKL (9) 

GL(KIi)PD W(WK) (KF)RK EES (10) 

(GAI)LLPD WKIRK EILIE (11) 

XMHH{VI) KLXYA TXSRK (12) 
M(LY) (LV) (RP)D WKRRK EILIE (13) 



(SEQ ID NO) 



X represents any amino acid; 

underlined residues represent amino acids that may be substituted with any amino acid but 
are assigned as indicated; 

amino acids within parenthesis ( ) indicate that one of the enclosed amino acids is present at 
that site. 



Using sequence infomiation stored in a computer readable medium, one skilled in the art can 
perfomn computer-implemented homology searches. Here, the nonredundant GenBankCDS 
translations+PDB+SwissProt+SPupdate+PIR protein databases, using BLASTp, indicated that the 
partial amino acid sequence of Pfu P50 and P45 do not exhibit identity to any protein in those 
databases. 



Example 6 

Nucleotide and Predicted Amino Acid Sequence of P50 

The nucleotide sequence of the Pf50 protein component was obtained by cloning the Pfu P50 
using standard techniques. 



1. Library Screening 

A Pyrococcus furiosus genomic library was plated on XL1-Blue MRF' £ coli at a density of 
approximately 2000 plaques per plate. Duralose filters (nitrocellulose on a nylon backing) were used 
to take replicate lifts from each plate. While the first filter was on the plate, orientation marks were 
made by stabbing a needle through the filter and into the plate. The orientation marks were marked 
in pen on the back of the plate before the filter was removed. The filter lifts were treated as follows: 



-30- 



1.5-2.0 minutes 1.5 M NaCI, 0.5M NaOH 

2 minutes 0,5 M Tris (pH 8.0), 1.5 M NaCi 

30 seconds 2xSSC, 0.2M Tris (pH 7.5) 

After treatment, the filters were partially dried until they were still damp, but no standing water was 

visible. The DNA on the filters was fixed by UV crosslinking with the Stratalinker set to the "Autolink" 

format. 

The filters were prehybridized in 15 ml of: 
5x SSC 

40 mM NaP04 pH (6.5) 
5x Denhardt's 
5% Dextran Sulfate 
50% Formamide 

0.1 mg/ml Salmon sperm DNA (Boiled separately and added immediately prior to use) 
Prehybridization was canied out at 42*^0 for approximately 2 hours. 

Probe was generated from the 900bp PGR product amplified from Pfu genomic DNA with the 
following degenerate primers: 

Oligo #50: CAT CAT GAA AAA CTA ATT TAC GC (SEQ ID NO: 14) 

C C C G T C 
T T 



Oligo #61: GC CAT AAT TAC TGC ATC GTA TTT (SEQ ID NO: 15) 

G C G G 
T G A 
A 

Oligo #50 was designed to hybridize to DNA encoding the HHVKLIYA (SEQ ID NO.: 66) peptide in 
SEQ ID NO.: 1, at the N-temiinus of P50, while oligo #61 was designed to hybridize to the antisense 
DNA strand encoding the peptide KYDAVIMA (SEQ ID NO.: 67) in SEQ ID NO,: 5. 

The PGR product was purified from free primers, buffer and nucleotides and 50 ng was 
labeled with ^^P-adATP using the Stratagene Prime-It II Random Primer Labelling kit. The probe 
was purified from free nucleotides before being boiled for five minutes and added to the 
prehybridization reaction. The total probe was roughly calculated to be 80 million cpm. 

Hybridization was allowed to continue overnight at 42°C before the hybridization solution was 
removed and the filters were washed four times with 0.1x SSC, 0.1% SDS at 60*'C (stringent 
conditions). 

The filters were exposed to X-ray film overnight and 20 primary isolates, with strong signals 
on both replicate filters, were picked. 

Six primary isolates were diluted, plated, and screened again using the same method 
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described above. Of the six, three filters produced isolated lambda clones. The clones were 
confirmed by PGR amplification using the degenerate primers. All clones were able to produce the 
900 bp amplified product with oligos #50 and #61, which was used as probe. Clones 6A and 3B 
produced a 1200 bp amplified fragment with oligos #54 and #58. Oligo #54 was designed to 
5 hybridize to DNA encoding the HHVKLIYA (SEQ ID NO: 66) peptide in SEQ ID NO: 1, and oligo #58 
was designed to hybridize to the antisense DNA strand encoding the EENQWL (SEQ ID NO.: 68) 
peptide of SEQ ID NO.: 6. Clone 6D only produced a 900 bp amplified product. 

Oligo #54: CAT CAT GAA AAA CTA ATA TAC GC (SEQ ID NO: 16) 

C C C G T C 
10 T T 

Oligo #58: AG TACTAC TTG ATT TTC TTC (SEQ ID NO: 17) 

A G G C G 
A A 

f Bluescript plasmid was excised from the lambda clones in SOLR cells and the presence of inserts 

il5 confirmed again by PGR amplification of the 1200 or 900bp product. 

2. DNA Sequencing 

Sequencing was earned out on purified PGR products and plasmid mini-preps made from the 
excised cells. The nucleotide sequence is listed below with the predicted protein translation. The 
^ peptide sequences used to generate the probes are indicated by underiining. "N" represents any 

!|20 base and "X" represents any amino acid. 

1 (SEQ ID NO: 18) 

2 ATGCTTCACCACGTCAAGCTAATCTACGCCACAAAAAGTCGAAAGCTAGT 
TGGAAAAAAGATAGTCNNNNNNNNNCCAGGGAGTATTGCGGCTTTGGATG 
TGAAAGCTTGTGAGGGACTAATTAGGCATGGGGCCGAAGTTCATGCAGTG 

25 ATGAGTGAGGCAGCCACCAAGATAATTCATCCTTATGCATGGAATTTGCC 

CACGGGAAATCCAGTCATAACTGAGATCACTGGATTTATCGAGCATGTTG 

AGTTAGCAGGGGAACATGAGAATAAAGCAGATTTAATTTTGGTTTGTCCT 

G C C ACTG CCAACACAATTAGTAAGATTGCATGTGGAATAGATGATACTC C 

AGTAACTACAGTCGTGACCACAGCATTTCCCCACATTCCAATTATGATAG 
30 CCCCAGCAATGCATGAGACAATGTACAGGCATCCCATAGTAAGGGAGAAC 

ATTGAAAGGTTAAAGAAGCTTGGCGTTGAGTTTATAGGACCAAGAATTGA 

GGAGGGAAAGGCAAAAGTTGCAAGCATTGATGAAATAGTTTACAGAGTTA 

TTAAAAAGCTCCACAAAAAAACATTGGAAGGGAAGAGAGTCCTAGTAACG 

GCGGGAGCAACAAGAGAGTACATAGATCCAATAAGATTCATAACAAATGC 
3 5 GAG CAGTGGAAAAATGGGAGTAG CGTTGG CTGAAGAAG CAGATTTTAGAG 

GAGCTGTTACCCTCATAAGAACAAAGGGAAGTGTAAAGGCTTTTAGAATC 

AGAAAAATCAAATTGAAGGTTGAGACAGTGGAAGAAATGCTTTCAGCGAT 
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TGAAAATGAGTTGAGGAGTAAAAAGTATGACGTAGTTATTATGGCAGCTG 
CTGTAAGCGATTTTAGGCCAAAAATTAAAGCAGAGGGAAAAATTAAAAGC 
GGAAGATCAATAACGATAGAGCTCGTTCCNNNNAATCCCAAAATCATTGA 
TAGAATAAAGGAAATTCAACCAAATGTCTTTCTTGTTGGATTTAAAGCAG 
5 AAACTTCAAAAGAAAAGCTTATAGAAGAAGGTAAAAGGCAGATTGAGAGG 
GCCAAGGCTGACTTAGTCGTTGGTAACACATTGGAAGCCTTTGGAAGCGA 
GGAAAACCAAGTAGTATTAATTGGCAGAGATTTCACAAAAGAACTTCCAA 
AAATGAAAAAGAGAGAGTTAGCAGAGAGAATTTGGGATGAGATAGAGAAA 
TTNCTGTCC 

10 Pyrococcus furiosus P50 predicted ai'nino acid sequence: 

(SEQ ID NO: 19) 

MLHHVKLIYATKSRKLVGKKIVXXXPGSIAA LDVIC^CEGLI RHGAEVHAV 

MSEAATKIIHPYAWNLPTGNPVITEITGFIEHVELAGEHENKADLILVCP 

ATANTISKIACGIDDTPVTTWTTAFPHIPIMIAP.?UVIHETMyRHPIVREN 
15 lERLKKLGVEFIGPRIEEGKAKVASIDEIVYRVIKKLHKKTLEGKRVLVT 

AGATREYIDPIRFITNASSGKMGVALAEEADFRGAVTLIRTKGSVKAFRI 

RKIKLKVETVEEMLSAIENELRSK KYDWIMAAAViSDPRPK IKAEGKIKS 
;S GRSITIELVPXNPKIIDRIKEIQPNVFLVGFKAETSKEKLIEEGKRQIER 
,y AK ADLWQNTLEAFGSEENQWIiIgR DFTKELPKMKKRSLAERIWDEIEK 
20 XLS 

=^^= Translated sequence corresponding to chemically-determined N-terminal sequence (SEQ 

ID NO. : 3) : 

MLHHVKLIYATKSRKLVGKKIVXXXPGSLAA (SEQ ID NO: 46) 
Translated sequences corresponding chemically-determined internal peptide sequences 
S5 (SEQ ID NOs. : 5 and 6) : 

:1 KYDWIMAAAVSDFRPK (SEQ ID NO: 4 7) 

Z ADLWGNTLEAFGSEENQWLIGR (SEQ ID NO: 48) 

The protein has a theoretical pi of 9.36 and a theoretical MW of 44801.29. 

There are inconsistencies between the chemically-determined AA sequence of P50 and the 
30 AA sequence derived from the P. furiosus P50 genomic clone. One skilled in the art is familiar with 
many reasons for this type of inconsistency. For example, the inconsistencies below can, largely, be 
explained by known limitations common to the procedures used. These limitations do not operate to 
limit the stmctural knowledge of proteins or nucleic acids. Instead, they merely indicate possible 
variations in the sequences of amino acids or nucleic acids by a finite number. 
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Some of the inconsistencies and explanations are: 

MLLHHVKLIYA TKSRR LVGKK IVLAI PGXIA AVEP (Table A; SEQ ID NO:s 1 and 3) 
MLHHV KLIYA TKL (Table B; SEQ ID NO: 9) 

MLHHV KLIYA TKSRK LVGKK IVLAI PGSIA ALDV (predicted sequence from SEQ ID NO, :19) 

5 The inconsistency in Table A sequence at cycle 2 (extra AA between AA1 and 2) may be due to 
contamination with P45, which appears to have L's at positions 2 and 3, Moreover, L at cycle 2 in 
SEQ ID NO: 1 was assigned tentatively. Other inconsistencies between the Table A sequence and 
the predicted sequence occur at AA 15 (R vs. K) and AA 32-34 (VEP vs. LDV). 

An inconsistency between the Table B sequence and the predicted sequence was found at 
10 AA13. The identification of AA13 as L instead of S is explained by the poor recovery of S in 

chemical sequencing and the contamination of P50 with low amounts of P45, which has a L at that 
position. 

2 ADLW GNTLE AFGSE ENQW LIGR (Table A; SEQ ID NO: 6) 

y ADLW GNTLE AFGSE ENQW LIGR (predicted sequence from SEQ ID NO. : 19) 

f\5 KYDAV IMAAA WDFR PK (Table A; SEQ ID NO: 5) 

KYDW IMAAA VSDFR PK (predicted sequence from SEQ ID NO.: 19) 

3 SEQ ID NO: 6, detennined chemically from a P50 tryptic peptide, was identical to a 24 AA sequence 
S translated from the P50 DNA sequence. For SEQ ID NO: 5, there were 2 inconsistencies found 

g between the chemical and DNA sequences. An A was recovered at cycle 4 instead of a V, and a V 
lo was recovered at cycle 12 instead of a S. The inconsistencies may be due to the difficulties 
associated with interpreting sequences from a sample that is not absolutely pure. 

Example 7 

Identification of Proteins Related to P50: Similarity to E.coli DFP Flavoprotein 
1. DNA Sequence Similarity of Pfu P50 Protein to Archael and Bacterial Proteins 

25 The DNA sequence of a P50 clone exhibits very strong homology to the flavoprotein DFP, a 

protein identified in E. coli as playing a role in DNA synthesis and pantothenate metabolism (Spitzer 
and Weiss, J. Bacteriol. 164:994-1003 (1985) and Spitzer, Jimenez-Billini, and Weiss, J. Bacteriol. 
170:872-876 (1988)). Although DFP was found to be an essentia! gene for DNA replication, these 
authors were not able to elucidate its role in DNA replication. The sequences in Table 1 (N-terminai 

30 1° sequence, tryptic peptides 107/108 and 112) are all found in the translated P50 clone, which 
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exhibits very high similarity to DFP. Accordingly, the P50 amino acid and DNA sequence 
information can be used to identify related proteins associated with PEF complexes from other 
sources such as bacteria, eukaryotes, and archae. 

The amino acid sequences of Methanococcus jannaschii (Mja) and £ coli dfp proteins 
5 support the identification of the protein designated P50 (Pfu) as a member of the dfp family of 

proteins. The three protein sequences were compared using ClustalW 1.6, with the comparison data 
represented below. 

1 15 16 30 31 45 46 60 61 75 76 90 

^ _ 1 Pfu MLHH VKLIYATKSRKLVGK KIVXXXPGS lAAlDV - KACEGLIRHGABVH AVMSEAATKl IHP YA WNLPTGKPVITEITC 78 

2 Mja MISEIMHP TKLLKGTKSKLLENK KILVAVTSSIAAIET PKLMRELIRHGABVY CI ITEBTKKI IGKEA LKFGCGNBVYHEITC 83 

3 E.coli MKARQQKYCDKIANP WCHPTGKI IMSLAGK KIVLGVSGGIAA'i'KT PELVRRLRLRGADVR VAMTEAAKAPITPLS LQAVSGYPVSDSLLD 90 



15 



mo 



Page 2.1 

91 105 lOS 120 121 135 136 150 151 165 166 180 

1 Pfu PIBHVEIAGB HENKADLILVCPATA NTISKIACGIDDTPV TrWTTAPPHIPlMI APAMHBTOYRHPIVR ENIBRLK-KLGVEPI 162 

2 Mja DIEHILLY-- --NECDCLLIYPATA NIISKINIiGIAD^fIV NTTAI^PPGKKPIPI VPAMHENMPN--AIK RHIDKLKEKDKIYII 162 

3 E.coli PAAEAAMGHIELG-- - -KWADLVILAPATA DLIARVAAGMANDLV STICLATP- -APVAV LPAMNQQHYRAAATQ HNLBVLA-SRGLLIW 173 



Page 3.1 

181 195 196 210 211 225 226 240 241 255 256 270 

VQ-. 1 GPRIEE GKA KVASIDBIVYHVIKK LHKKTLE-GKRVI.VT AGATREYIDPIRPIT NASSGKMGVALAEEA DPRGAVTLIRTKGSV 245 

2 Mja SPKPEE GKA ICVANIEDWKAVIEK IGNNLKKEGNRVI.IL NGGTVEPIDKVRVIS NLSSGKMGVALAEAF CKEGPYVEVITAMGL 246 

"\ 3 E.coli GPDSGSQACGDIGPG RXXDPLTIVDMAVAH PSPVNDLKHLNIMIT AGPTREPLDPVRYIS NHSSGKMGPAIAAAA ARRGANVTLVSGPVS 263 



Page 4.1 

271 285 286 300 301 315 316 330 331 345 346 360 

1 Pfu KAPRIRKIKLKVETV BBMLSAIENELRSKK YDWIMAAAVSniTlP KIKABGKIKSGRS- - --1TIBLVPXNPKII DRIKBIQPN-VFLVG 330 

2 Mja EPPYYIICNHKVLTAK BHLNKAIE- -L-AKD FDIIISSAAISDiTV BS-FEGiaSSBBB- - --LILKLKR-NPKVL BELRRIYKD-KVIIG 326 

3 E.coli LPTPPFVKRVDVMTA LBMEAAVII--ASVQQ QNIPIGCAAVAD^-RA ATVAPBKIKKQATQG DBLTIKKVK-KPDIV AGVAALKDHRPYWG 350 

Page S.X 

361 375 376 390 391 405 406 420 421 435 436 4S0 

1 Pfu FKAETSK-BK-LIEE GKRQIERAKADLWG NTL SAPGSEBN QWLIGRDFTKELPK MKKRELABRIWDEIE KXLS 403 

2 Kja FKAEYNLDEKELINR AKBRLNXYKLNMI lA NDLSK — HYPGDDYI EVYIITKYEVBKISG SKK-EISBRIVEKVK KLVKS 403 

3 E.coli FAAETNN VEEY ARQKRIRKNLDLICA NDVSQPTQGPNSDNN ALHLFWQDGDICVLPL ERKBLLGOLLLDBIV TRYDEKNRR 430 



;^ From the above comparison, it would be apparent to one of skill in the art that related 

proteins from other species can be identified and isolated by methods known in the art. The 
example above employed stringent screening conditions. Less stringent conditions, varying the 

35 concentration of salts, detergent, or the temperature during hybridization or washing, as known in 

the art, would lead to related clones from libraries containing sequences of any of a number of 

species. For example, in addition to the conditions described above, any of the following 

hybridization conditions can be used, in any combination, in methods to isolate DNA sequences 

related to the P50 or P45 sequences herein: 

40 low stringency wash in a solution comprising approx. .45 M NaCI, approx. .045 M trisodium 

citrate, and approx. .1% SDS, at approx. 37^ to approx. 42*^0; 

hybridization buffer comprising approx. .75M NaCI, approx. .15 M Tris, approx. 10 mM 
sodium pyrophosphate, approx. .075 M trisodium citrate, and approx. 50% formamide; 
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hybridization buffer comprising approx. 5 x SSC, approx. 5x Denhardt's. approx. 5% Dextran 
Sulfate, approx. 50% fomnamide, and approx. .1 mg/ml ssDNA; 

hybridization wash comprising approx. .1 M phosphate, approx, .1 x SET, approx. .1% 
sodium pyrophosphate, and approx. .1% SDS at approx. 45**C. 

2. Absorbance Spectrum of Purified P. furiosus PEP Complex 

The absorbance spectrum of purified P. furiosus PEF complex reveals two peaks of 
absoriDance at 370 and 450 nm. Figure 13 depicts the absorbance spectmm of S-200 purified P. 
furiosus PEF. These data indicate and are consistent with the PEF complex comprising at least one 
flavoprotein. Sequencing data also verifies the identification of P. furiosus P50 as a homoiog of the 
£ coli DFP protein. E. coli DFP is a flavoprotein containing a non-covalently associated FMN 
moiety. 

Up to this point, flavoproteins have not been directly implicated as part of the replication 
machinery. The potential involvement of a flavoprotein in PGR enhancement suggests a role for 
redox reactions. The only redox reaction involved in DNA synthesis is the formation of 
deoxyribonucieotides from ribonucleotides, which is catalyzed by ribonucleoside diphosphate 
reductase. In vitro, the ribonucleoside diphosphate reductase enzyme can be coupled to NADPH via 
two known pathways involving FAD-containing oxidoreductases (Pigiet and Conley, J. Biol. Chem. 
252:6367-72 (1977); Theiander and Reichard, Ann. Rev, Biochem. 48:133-158 (1979)). One 
pathway involves thioredoxin and thioredoxin reductase. Interestingly, £ coli thioredoxin has been 
shown to act as an accessory protein and confer processivity on T7 DNA polymerase. An alternate 
hypothesis for the potential role of a flavoprotein in PGR enhancement is that a flavoprotein may be 
required for the continuous processing or activation of other proteins or cofactors needed for nucleic 
acid replication. 

Example 8 

Nucleotide and Amino Acid Sequence of P45 

The nucleotide sequence of the Pfu P45 protein component was obtained as described 

below. 

1. Synthesis of P45 Sequences 

Amino temriinal peptide sequencing of purified P45 protein allowed the generation of four 
degenerate oligonucleotides designed to hybridize to DNA encoding the PDWKIRKE (SEQ ID 
NO.:69) peptide of SEQ ID NO.: 11, as follows: 
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#743: CCA GAC TGG AA;^. ATA AGG AAA GA (SEQ ID NO: 32) 

T G TGG 

C 
T 

5 #744: CCA GAC TGG AA^. ATA AGA AAA GA (SEQ ID NO: 33) 

T G TGG 

C 
T 

#74 5: CCA GAC TGG AAPx ATA AGG AAG GA (SEQ ID NO: 34) 

10 T G TGG 

C 
T 

#746: CCA GAC TGG AAPx ATA AGA AAG GA (SEQ ID NO: 35) 

T G TGG 

15 C 

T 

A lambda phage P. furiosus genomic library was used as template for PGR amplification of 
3 the P45 sequence. The four degenerate oligonucleotides were used in separate reactions to prime 
S template in one direction, in combination with one of the primers that border the genomic insertion of 
^^0 the lambda vector (-20 primer and reverse primer). The PGR reaction was canied out as specified 
p below: 

" 10 //I 10x Stratagene cloned Pfu buffer 

5 //I degenerate P45 primer (either # 743, 744, 745 or 746) at 100 ng///l 

2.0 //I either reverse or -20 primer (100 ng///l) 
$5 0.8 //I 100 mM dNTP (total dNTPs) 

% 0.5 //I Taq DNA polymerase (Stratagene, 5u///l) 

f 0.5 //I Taq Extender (Stratagene, 5u///l) 

I 3.0 A/I Pfu genomic library ( - 1 .2 x 1 0^° plaque forming units/ml) 

30 PGR cycling was canied out in a RoboGradient temperature cycler as follows: One cycle at 95°G for 
3 minutes, followed by 30 cycles of: 95°G for 1 minute; 51*'G to SS'^G gradient for 2 minutes; 72°G for 
6 minutes. 

The PGR products were separated on a 1% agarose, 1x TBE gel. All primer combinations 
produced multiple bands. A pattern of four bands was consistently seen with primers 743, 744, and 
35 746 in conjunction with the -20 primer. The three degenerate primers that fomied consistent four 

band patterns with the -20 primer were able to generate the pattern at 56°C. Only primer 743 could 
generate the pattem at 58°G. The band pattem produced with the degenerate primers in 
combination with the reverse primers was less distinct and fomied only at lower annealing 
temperatures than the products generated with the -20 primer. 
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2. Cloning Strategies 

Two strategies were used to isolate the P45 clone. One procedure was to make simplified 
sub-libraries of the original highly complex library and screen for an insert with the -20 and 743 
primers. Positive sub-libraries could be diluted and rescreened until individual plaques containing 
the appropriate insert were identified. The other technique was to make use of Vectorette™ 
technology (Genosys Biotechnologies), which allows PGR amplification when the sequence of only 
one end of a DNA fragment is known. In the vectorette system, genomic DNA is digested with a 
selection of specific restriction endonucleases. After digestion, the ends of the genomic DNA are 
ligated to specific vectorette units, which have the same cohesive temiini as the genomic DNA 
digestion. The ligated vectorette unit contains a sequence complimentary to a provided vectorette 
PGR primer. (Arnold and Hodgson, PGR Methods and Applications 1: 39-42 (1991),) 

3. The Vectorette Reaction 

Fifty /j\ reactions containing 100 ng of P. furiosus genomic DNA were digested with Eco Rl, 
Hind III and Bsp 1061 (an isoschizomer for C/a I) in their recommended buffers for one hour at 37°G. 
Without any post-reaction treatment, 1 ij\ of the appropriate vectorette unit {Hind III, C/a I or Eco Rl 
at 3 pmole///l) was added with 6 //I of 10 mM ATP, 1 //I of 100 mM DTT and 1 unit of T4 DNA ligase 
(Stratagene 4 u//il). The reaction was cycled at the following temperatures: 20'*G for 60 minute 
followed by 37''G for 30 minutes for 3 cycles. 

The ligated DNA was amplified according to the following: 

10 A/I cloned lOx Stratagene Pfu buffer 

8.3 //I degenerate P45 primer at 100 ng//il 

2.0 fj\ 50 pmol///l vectorette primer 

0.8 //I 100 mM (total) dNTP 

0.5 //I Taq DNA polymerase (Stratagene, 5u///l) 

0.5 fj\ Taq Extender (Stratagene, 5u///l) 

1.0 //I vectorette library 

76.9 fj\ H2O 

PGR cycling was carried out as follows: One cylce at 95°G for 1 minute followed by 30 cycles of: 
95°G for 1 minute; 56°G for 2 minutes; and 72°G for 3 minutes. 

Ten //I were loaded on an 1% agarose, 1x TBE gel. Multiple bands were produced by all 
primers except 745. To detennine if all three vectorette library products had been correctly primed 
off the same target DNA (P45 sequence) rather than having been produced by a non-specific PGR 
reaction, the products were digested with Mnl I. Mnl 1 cleaves at a frequent four base pair 
recognition sequence and produces a useful pattern of bands specific to the template digested. The 
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pattern generated by electrophoresis of the Mnl I digestion fragments of the C/a 1/743, Hind 111/744 
and Eco RI/744 PGR products on a 6% acrylamide gel showed some variation, but the majority of 
bands could be identified in ail three samples, indicating that they share large segments of identical 
sequence. 

4, Screening 

The PGR products from the Cla 1/743 and Hind 111/744 combinations were mixed and purified 
from free nucleotides and unused primers before being used as template for the generation of a 52 
million cpm ^^P labeled probe. Details on probe synthesis and library screening are cited in Example 
6. 

More than 60 positive clones resulted from screening with the mixed vectorette probe. 
Several positive were well situated for collection without significant contamination from adjoining 
plaques. Twelve of these plaques were subjected to PGR amplification with the 743 and -20 primer 
as described previously except that an annealing temperature of 56^G was used instead of a 
temperature gradient. In the same amplification assay, 11 sub-libraries were assayed In the same 
manner. 

Three of the twelve clones recovered from the primary radioactive label screen produced 
strong, single bands. Glone 1 produced a band of approximately 5 kb, clone 3 produced a band of 
approximately 3,5 kb, and done 9 generated a band of approximately 2.7 kb. One of the sub- 
libraries also produced a clone of approximately 6.5 kb. 

5. Sequencing 

Sequencing of the P45 clones was carried out on purified PGR products and plasmid mini- 
preps made from excised cells. The nucleotide sequence of P45 is listed below with the predicted 
amino acid translation. The chemically-determined N-terminal sequence of P45 (SEQ ID NO.: 11), 
used to generate the degenerate PGR primers (SEQ ID NO.: 32-35), is indicated by underlining. 
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P45 nucleotide sequence 

ATGCTACTTC CAGACTGGAA 
TTCTGAAGAA TCGCTCCAAC 
AGGCTTTTGT TAAGGGGAAA 
5 GTTATTCCTC CAAGGGAATA 
GTTGCCCGAC GATGTTATGG 
GAGAAGGGGT TATTGGTTCT 
AACTTAACAC TAATGCTCTA 
ATATGGAGAG AGATTTGTGC 
10 CAAGAAACCC TTACAGAGGA 
TCAAAGAGAA AGAAACTCTA 

P45 amino acid sequence 

M LLPDWKIRK EILIE PFSEE 

VIPPREYALI LTLERIKLPD 
;J5 NLTLMLYNAS NEPVELRYGE 
CP SKRKKL* 

The translated P45 protein has a theoretical pi of 9.12 and a calculated molecular weight of 
,j 17868.76. The translated N-temninal sequence (underlined) of P45 con-esponds to the 

experimentally-detennined sequence (SEQ ID NO.: 11) and matches the exact sequence (SEQ ID 
1^ NO.: 60) used to design the degenerate PGR primers. 

O When the P45 DNA sequence is translated in all six frames and compared to multiple 

'^^ sequence databases using the computer-implemented program Blastx, the dCTP deaminase gene of 

Q Desulfurolobus ambivalens was found to have similarities. Another entirely different gene encoding 

polyprotein from Visna and MaedaA/isna vims was also identified, but at a less significant level of 
25 sequence similarity. Each of these similar genes, as well as those discussed below, may represent 
sequences related to P45. Thus, sequence infomriation stored on computer readable medium may 
be used with computer based methods to search for homologous, similar, or identical sequences in 
a database to develop further P45 and PEF-encoding DNA sequences in accordance with this 
invention, as shown in example 9 below. 



(SEQ ID NO.: 70) 

AATCAGAAAA GAAATACTTA TAGAGCCATT 
CAGCAGGTTA TGACCTCAGA GTGGGCAGAG 
TTAATCGACG TGGAAAAGGA AGGAAAAGTC 
CGCCTTAATC CTAACCCTCG AGAGGATAAA 
GGGATATGAA GATAAGGAGC AGTTTAGCAA 
TTTGCTTGGG TTGACCCAGG ATGGGATGGA 
CAATGCCTCA AATGAACCTG TCGAATTAAG 
AGATCGCATT TATAAGGCTA GAGGGTCCGG 
AACTATCAGG GGAGCACAAG GTTAGCGTTT 
G 

(SEQ ID NO,: 71) 

SLQPAGYDLR VGREAFVKGK LIDVEKEGKV 
DVMGDMKIRS SLAREGVIGS FAWVDPGWDG 
RFVQIAFIRL EGPARNPYRG NYQGSTRLAF 
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Example 9 

Identification of Proteins Related to P45 

1. DNA Sequence Homology of P. furiosus P45 Protein to dCTP Deaminase 
and dUTPases 

When the P45 DNA sequence wajs compared to multiple databases using the program 
BlastX, the probable deoxycytidine triphosphate deaminase (dCTP deaminase) gene (dcd) of 
Desulfurolobus ambivalens was found to exhibit the highest degree of similarity. The biochemical 
properties and physiological role of dCTP deaminase has been studied in E coli, where dCTP 
deaminase is a homotetramer, which catalyzes the fonnation of dUTP and NH3 from dCTP. DNA 
sequences with similarity to the E coli dcd gene have been discovered in other bacteria {Salmonella 
typhimurium, Haemophilus influenzae) and in archea (Ouzounis. C, Kyrpides, N., and Sander, C. 
NucL Acids Res. 23:565-570 (1995); Bult C.J. et al. Science 273:1058-1073 (1996); Beck, C.J., 
Eisenhardt, A.R. and Neuhard, J., J. BioL Chem. 250:609-616 (1975); Fieischmann, R.D. et ai, 
Science 269:496-512(1995)). Amino acid sequence comparisons between P, furiosus P45 and the 
dCTP deaminases from Desulfurolobus ambivalens CA")(also known as Acidianus ambivalens, 
Sulfolobus ambivalens), E coli CE'), and Haemophilus influenza are shown below, demonstrating 
sequence similarity ranges from about 39.1 to about 71.2%. 



MAP Multiple Sequence Alignment Results 
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1 15 16 30 31 45 46 60 61 75 


76 90 




1 p45 


MLLPDWKIIUtB ILIEPPSE-BWLQP AGYDLRVGXX XElVGR BAXVKGK--- -LIDVEK 


- --EGKVXIPPREYAL 


65 


2 A. 


MILGDRDLKYYLEKG WIVISPLTQ-DTIRE NGVDLRVGG- -BIAR FKKTDEIYEDGKDPR SPYBIEK 


- -GDEPI I YPNBHVL 


77 




MRLCDRDIEAWLDBG RLS INPRPPVBRING ATVDVRLGNKFRTFR GHTAAPIDLSGPKDB VSAALDRVMSDEIVL 


DBGEAFYLHPGELAL 


90 


4 HAEIN MRLCDTDIERYLDDG IISLTPRPNNDKIHG ATIDVmiGNSPR\TR EHSAPPIDLSGPKBB VSAQLESVMSDBIII 


PBGEAPPLHPGTLAL 


90 
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91 lOS 106 120 121 135 136 ISO 151 165 


166 180 




1 p4S 


ILTLERIKLPDDVMG DMKIRSSLAREGVIG SPAW- -VDPGWDG'.NL T1«LYKASNEPVBLR YGERFVQIAPIRLEG PARNPYR GNYO 


149 


2 A. 


LVTEEYVKLPNDVMA FVNLRSSPARLGLFV PPTl - -VDAGPEGiQL TIEVLG-SAPPVKIX RGTRPLHLIPARTLT 


PVENPYH GKYQ 


160 


3 E. 


AVTLSSVTLPADLVG WLDGRSSLARLGLMV HVTAHRIDPGWSGrCI VLEFYNSGKLPLALR PGMLIGALSFEPLSG 


PAVRPYNRRSDAKYR 


180 


4 HAEIK 


ATTLESVKLPANIIG WLDGRSSLARLGLMV HVTAHRIDPGWBGJKI VLEFYNSGKLPLALR PNMVIGALSFEVLS6 


EXKRPYSSRKDAKYK 


180 
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181 195 196 210 211 ;:25 226 240 241 255 


256 270 




1 p4 5 


GSTRLAPSKRKKL-- 162 






2 A. 


GQQGVTLPKPKPR-- 173 






3 E. 


NQQGAVASRIDKD-- 193 






4 HAEIN NQQSAVASRIDEDKB 195 







E. coli DCD exhibits an apparent molecular weight of 21.2kD (Wang, L. and Weiss, B. J. 
BacterioL 174:5647-5653 (1992)), while the predicted molecular weight of MJannaschii DCD is 
approximately 22kD (204 amino acids). These molecular weights are approximately half the 
apparent molecular weight of P. furiosus P45 and suggest that the heat-dissociated forni of P45 
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(apparent mol. weight of 45kD) may actually be a dimer. 

In addition to dCTP deaminase, P. funosus P45 exhibits a significant, but lower degree of sequence 
similarity to uridine triphosphatase (dUTPase). dUTPase. an enzyme encoded by the dut gene, 
converts dUTP to pyrophosphate and dUMP. An amino sequence alignment comparing the 
sequence of P45 to several dUTPases is shown below. Regions of identity are shown in black 
boxes, while regions of similarity are shown in grey boxes. 
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One of the regions of sequence similarity between P45 and dUTPase is the putative uridine- 
binding motif. This motif is conserved in the translated amino acid sequence of P. furiosus P45, 
which reveals the presence of a putative uridine-binding sequence conserved in i/ synthetases, 
dCTP deaminases, and dUTPases (Koonin, E.V. NucL Acids Res. 24:2411-2415 (1996)). In the 
5 following comparisons, U represents a bulky hydrophobic residue such as I, L, V, M, F, Y, or W. and 



the bolded residues match the G, D or U residues of the consensus sequence. 

Consensus uridine-binding 





motif : 




.GUUID. .U.G.U.U 


(SEQ 


ID 


NO. : 


72) 




P. fur. P45: 




FAWVJD PGWDGNTLM 


(SEQ 


ID 


NO. : 


73) 


10 


M, jann. DCD 




AGWI]DAGFKGKITL 


(SEQ 


ID 


NO. : 


74) 




M. jann put. 


DCD: 


SAVHDPGYEGRPEY 


(SEQ 


ID 


NO. : 


75) 




D. sulf. DCD 




PTIVIDAGFEGQLTI 


(SEQ 


ID 


NO. : 


76) 




E. coli DCD: 




AHRIDPGWSGCIVL 


(SEQ 


ID 


NO. : 


77) 




£. coli DUT: 




VGLIDSDYQGQLMI 


(SEQ 


ID 


NO. : 


78) 




Yeast DUT: 




AGWIDRDYTGEVKV 


(SEQ 


ID 


NO. : 


79) 




Human DUT: 




AGVIDEDYRGNVGV 


(SEQ 


ID 


NO. : 


80) 




Herpes virus 


DUT: 


TGLIDPGFQGELKL 


(SEQ 


ID 


NO. : 


81) 



Each of these proteins represent activities, such as dUTPase, that may be used as a 
polymerase enhancing activity or PEF. One skilled in the art can identify numerous other proteins 
120 using stored sequence infomnatlon, in the appropriate computer readable medium, from this 
==5 disclosure and analogous searching procedures in other databases. Database infomriatlon on each 
4 of the following species can specifically be used to identify PEF using one or more of the 
% sequences, or parts thereof, identified herein: Pyrococcus furiosus; Pyrococcus horikoshii; 

Pyrobaculum aerophilum; Sulfolobus solfatahcus; Archaeoglobus fulgidus; Aquifex pyrophilus; 
25 Methanococcus jannaschii; Thermotoga maritima; Methanobacterium thermoautotrophicum; and 
Thermus thenvophilis. 

The physiological function of dCTP deaminase has only been studied in £. coli, where it 
plays an essential role in deoxyribonucleotide metabolism. dCTP deaminase converts dCTP to 
dUTP, which is-^n obligatory step in the de novo synthesis of thymidylate in bacteria (Taylor, A.F., 
30 and Weiss, B. J. Bacteriol. 151:351-357 (1982)). In turn, uridine triphosphatase (dUTPase encoded 
by the dut gene), a ubiquitous enzyme found in bacteria, eukaryotes, and eukaryotic vimses, 
degrades dUTP to pyrophosphate and dUMP, the thymidylate synthetase substrate. Thus, dCTP 
deaminase and dUTPase are functionally linked, with mutations in the dcd gene suppressing dut 
mutations (Wang, L. and Weiss. B. J. Bacteriol. 174: 5647-5653 (1992)). 



-44- 



dUTPase has shown to be an essential gene in £ coli and in yeast (El-Hajj, H.H,, Zhang, H.. 
and Weiss, B. J. Bacteriol. 170: 1069-1075 (1988); Gadsden, M.H., et al. EMBO J. 12:4425^431 
(1993)) because it functions in vivo to prevent dUTP incorporation into DNA. In E. coli dut mutants, 
the dUTP pool is greatly enhanced, resulting in an increased substitution of uracil for thymine in 
5 DNA during replication. Uracil-DNA glycosylase and exonuclease III play an essential role in 
repairing uracil-containing DNA in £ coli dut mutants (Taylor, A.F. and Weiss, B., J. Bacteriol. 
151:351-357 (1982)). 

The substrate specificities, enzyme activities, and physiological role of dCTP deaminase and 
dUTPase had not been characterized in archea prior to this disclosure, 

10 Example 10 

Expression Cloning and Characterization of Recombinant P45 
1. Expression Cloning of P45 by the Method of Ligation Independent Cloning (LIC) 

Recombinant P45 was produced by PGR amplification of a portion of a P45 genomic clone 
(clone #9). The primers (oligos # 1 and 2 below) were designed to function with the Affinity Protein 
15 Expression and Purification System (Stratagene; La Jolla, OA), which uses Ligation Independent 
Cloning (LIC). 

Oligo # 1 . 5' GACGACGACAAGATGCTACTTCCAGACTGGAAA 3' (SEQ ID NO: 82) 
OligO # 2. 5' GGAACAAGACCCGTCCCACTTTCACAGATGAAGAG 3' (SEQ ID NO: 83) 

The bold letter segments represent sequences specific to the cloning vector while the 
20 adjoining sequence is specific to the clone #9 sequence. The ATG underiined in oligo #1 

corresponds precisely to the 5' end of the P45 gene, while the sequence after the bold letters in 

oligo #2 con-esponds to the end of the genomic insert. 

The PGR amplification was canied out in a volume of 100 //I containing: 1x Cloned Pfu 

Polymerase Buffer; 0.2 mM dNTPs (each); 200 ng of Primer oligo #1; 200 ng of Primer oligo #2; 3 //I 
25 Genomic clone #9 plaque core in 500 //I SM buffer (--2000 Lambda particles); 2.5 units Cloned Pfu 

DNA Polymerase; and 7 ng Native PEP (where 10x Cloned Pfu Polymerase Buffer is: 100 mM KCI; 

100 mM (NHJ2SO4; 200 mM Tris-CI (pH 875); 20 mM MgS04; 1% Triton® X-100; and 1000//g/ml 

BSA). 

The thermocycling parameters were: 95*'C for 1 minute (1 cycle); dS^'C for 1 minute - 56°C 
30 for 1 minute - 72°C for 5 minutes (30 cycles). 
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The 2.5kb amplified product was purified and then subjected to limited nucleotide excision in 
the presence of dATP. This protocol promotes removal of nucleotides at the 3' termini of the PGR 
product until an adenine residue is reached. The excision mixture (consisting of: 1x Cloned Pfu 
Polymerase Buffer; 0.5 mM dATP; 43.5 pi PGR product (8.7 ng///l); 1.25 units Gloned Pfu DNA 
polymerase) was incubated at 72°G for 10 minutes. 

20 //I of the exonuclease treated PGR product was annealed with 40 ng of digested pGAL-n- 
EK vector (exonuclease treated to produce ends complimentary to the sequence exposed in the 
PGR product) for 45 minutes at room temperature. The amount of insert molar excess, relative to 
vector, was approximately 9 fold. The pCAL-n-EK vector contains an upstream, in-frame calmodulin 
binding peptide tag, which allows the N-terminal fusion protein to be easily purified on calmodulin 
agarose (GAM agarose). Various other expression vectors, which may or may not produce fusion 
proteins, are known in the art and can be used to express P45 protein or fragments thereof or to 
produce DNA constructs with a sequence encoding P45 protein or fragments thereof An expression 
vector need only contain DNA sequences operating to pemiit or control transcription from an 
appropriately linked nucleic acid. The type of control, the degree of transcription pemriltted, and the 
manor in which the vector and nucleic acid are appropriately linked may vary. Generally, an 
expression vector also contains a replication control sequence to allow the vector to replicate in a 
host. However, replication control sequences are not required where replication of the host is not 
crucial to expression. 

2. Purification of Recombinant 1P45 

Five microliters of the annealed vector/insert DNA was transformed into XL2-Blue competent 
cells. Ten of the resultant colonies were screened by PGR for the 2.5 kb insert and 9 were found to 
contain the conrect size insert. Plasmid DNA was prepared from three clones and then used to 
transform BL21(DE3) cells. Six BL21(DE3) colonies were grown for approximately 10 hours in 5 ml 
of 1x LB, 125 //g/ml ampicillin at 37^G. These cultures were used to inoculate six flasks containing 
250 ml 1x LB and 125 //g/ml ampicillin. When the optical density (ODgoo values) of the cultures 
reached 1.1 to ,13, IPTG was added to a final concentration of about 1 mM. The cells were grown 
overnight at 37''G. Gells were collected by centrifugation and the pellet estimated (visually) to be in 
the range 0.4 - 0.5 gm. The pellet was resuspended in 2.5 ml of calcium binding buffer: 50 mM 
Tris-HGI (pH 8.0); 150 mM NaGI; 10 mM 0-ME\ 10 mM magnesium acetate; 2 mM GaGlj. 

250 A/I of 10 mg/ml lysozyme was added to the cells and the reaction was allowed to incubate 
on ice for one hour. The slightly lysed samples were sonicated twice with the Branson Sonifier 250, 
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the microtip at a duty cycle of 50% and a setting of 4. The lysate was cleared by superspeed 
centrifugation. Cleared lysate was added to 700 //I of washed Stratagene Calmodulin agarose (50% 
CAM agarose by volume) and allowed to bind with gentle rocking at 4*'C for 1 hour. The resin was 
washed 3 times with 10 ml of CaCIa binding buffer and twice with 5 ml of the same solution except 
that the CaClz concentration was reduced to 0.1 mM. Washing was accomplished by centrifugation, 
removal of the supematant, and resuspension in fresh buffer. 5 |/l of the CAM resin was collected to 
examine on SDS-PAGE electrophoresis. The washed matrix was resuspended in 900 //I of elution 
buffer [50 mM Tris-HCI (pH 8.0); 150 mM NaCI; 10 mM j9-ME; 2 mM EGTA] and allowed to sit for 
one minute prior to pelleting of the agarose resin. The elution buffer containing P45 was removed to 
a separate tube and the CAM agarose was resuspended again in 900 //I of elution buffer. Next, a 
high salt elution buffer [50 mM Tris-HCI (pH 8.0); 1000 mM NaCI; 10 mM jff-ME; 10 mM magnesium 
acetate; 2 mM EGTA] was used to elute remaining P45 protein from the CAM agarose: 

Subsequent SDS-PAGE analyses showed that the high salt elution buffer released a majority 
of the recombinant P, furiosus P45 protein from the column matrices. Some residual protein 
remained bound to the calmodulin agarose. 

The method described here to produce P45 protein can be modified in numerous ways by 
methods known in the art. (Ausubel, P.M., et al. (1989) Cun^ent Protocols in Molecular Biology, 
Greene Publishing Associates and Wiley-lnterscience, New York, NY; Sambrook, J., et al. (1989) 
Molecular Cloning: A Laboratory Manual, 2nd Ed., Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, N.Y.) In one possible modification, a P45 analog protein can be produced. For example, a 
mutation can be introduced into the P45 coding region. Any type of mutation can be used including 
site-specific point mutation, deletion mutation, insertion mutation, and multiples or combinations of 
these mutations. This mutant coding region is inserted into an appropriate vector, which is 
transferred into a host cell. The host cell then expresses the P45 analog. A P45 analog protein 
substantially retains one or more of the PEF activity or dUTP or dCTP metabolic activities described 
herein. Thus, the fusion protein and affinity tag expression and purification system described here is 
only one of many ways to produce a recombinant PEF protein such as recombinant P45. 

3. Comparison of Recombirtant P45 and Native PEF PGR Enhancing Activity 

The 5.2kb "On/Off" assay, described in example 1, was used to demonstrate that 
recombinant P45 (rP45) possesses PEF activity comparable to a natural PEF. The results are 
shown in Figure 14. When no PEF activity is added, the 5.2 kb amplification product is not 
generated as shown in the lane labeled "none" of Figure 14. When proteins produced from the 
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recombinant clones of P45 (called "1" or "23" from their plasmid delineation) were added to the 
On/Off assay in amounts of 5//I, 1 //I, or 1I//I of a 1/10 dilution (approximately 100, 20 and 2 ng, 
respectively), they all showed enhancement of PGR product yield comparable to the two native PEF 
preps, interestingly, 5 //I worked less well than 1//I. An "overdose" phenomenon was also noted 
5 with native PEF (example 16). 

4. Molecular Weight of Recombinant P45 Compared to Native PEF Complex and 
Protein Components of tlie Complex 

The predicted molecular weight of the translated rP45 DNA sequence was 18.6 kDa. 

However, the native P45 component migrates as part of the PEF complex at 300 kDa without heat 
10 denaturation, and at 45 kDa with heat denaturation (99°C for 5 minutes, partially denatured). Protein 

complexes in hyperthemnophiles are unusually stable and sometimes require extreme conditions 

before disassociation occurs. We have found that P45 migrates at approximately 18kD, or 
% approximately 17-18kD, instead of 45kD when the native PEF complex is boiled in the presence of 

Cm 2% SDS and 1 % trichloroacetic acid (TCA) (fully denaturing conditions)(Figure 15). Without heat 

ilj5 denaturation, more than half of the recombinant P45 migrated at approximately 50 kDa (45 + 4 kDa 

calmodulin binding tag), while the remaining sample migrated close to 26 kDa (Figure 15). With 
u| heat denaturation all of the sample migrated at approximately 26 kDa, suggesting that the 

recombinant is a dimer in its undenatured state. Trichloroacetic acid heat treatment failed to 
m produce any additional bands in the recombinant sample. 

- jo The migration differences between the fully denatured fomns of the native (18kD) and 

A recombinant P45 (26kD) might be explained by differences in post-translational modifications 

^3 between P. furiosus and the £ coli host or by the effects of the 4kD GBP (calmodulin binding 

protein) tag on the folding or migration of the rP45 fusion protein. 

Example 11 

25 Potential Mechanisms of PGR Enhancement by P45 

1. Identification of the Reaction Catalyzed by PEF/P45 

The siririilarity of P45 to dCTP deaminases prompted us to test whether dCTP or other 
nucleotide triphosphates were substrates of PEF/P45. PEF was incubated with dCTP or dUTP and 
the reaction products were separated by reverse phase HPLC. Reaction mixtures (50^/1) containing 
30 1x cloned Pfu polymerase buffer, lOmM dCTP or dUTP, and 700ng PEF (or 5//I of cloned p45 (-100 
ng) or nothing (neg. control)) were incubated at 72°C for 1 hour. The samples were injected on a 
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3.9 X 250 cm Waters Delta-pak C-18 column (300 angstrom/1 5//m), equilibrated in 50mM 
triethylammonium acetate, pH(7.0) (A). Products were eluted with acetonitrile (B) at 2 ml/min. using 
the following gradient: 0% B for 5 minutes, 0-10% B over 20 minutes. Absorbence of the eluate was 
monitored with a photodiode an-ay detector, and peak areas were integrated at 260nm. 

The data in Figure 16 show that both the native PEF complex and purified recombinant p45 
utilize dUTP as a substrate. In the presence of cloned Pfu DNA polymerase PGR buffer, dUTP 
migrates at 10.517 minutes. After a one hour incubation with PEF or recombinant P45 (purified from 
clone 1 or 23), the dUTP peak disappeared (0.1% total peak area) and a new peak appeared eluting 
at 4.400 minutes (98% total peak area). The product of the PEF + dUTP reaction migrated with the 
same retention time as a dUMP standard, which was different from the retention times of dUDP, 
dCMP, dCDP, and dCTP. To confirm that the product was dUMP, a dUMP standard and the PEF + 
dUTP product were mixed together and re-injected. The mixed sample produced only one peak. 

In addition to dUTP, dCTP was also found to serve as a substrate for PEF (Figure 17). The 
product of the PEF + dCTP reaction appeared to be dCMP, based upon its retention time and 
absoriDence maxima. Although PEF utilizes dCTP, dUTP is the prefen-ed substrate of PEF/P45. In 
the example shown, only 16.6% of the dCTP was converted in 1 hour under the same conditions 
which converted 100% dUTP (Figure 17, panel B). Moreover, when dCTP and dUTP were 
incubated together with PEF, only dUMP (4.2 minute peak in panel C) was generated. No reactivity 
with PEF was detected for the following nucleotides: dGTP, dATP, dTTP, dCMP, dUMP, and rUTP 
under the conditions used. 

The substrate preference and reaction catalyzed by PEF/P45 was significantly different from 
that predicted based upon the amino acid similarity to dCTP deaminases. Although both enzymes 
bind dCTP and dUTP, the forward reaction catalyzed by dCTP deaminase is the deamination of 
dCTP to produce dUTP + NH3. PEF/P45, on the other hand, preferentially utilizes dUTP, and 
catalyzes the release of pyrophosphate (PPi). No deamination of dCTP by P. furiosus PEF or P45 
was observed under the conditions used. 

2. Enzymatic Detection of Inorganic Pyrophosphate (PP,) Produced by PEF 
from dUTP 

We tested whether inorganic pyrophosphate (PPj) or inorganic phosphate (P) was generated 
during the PEF-catalyzed conversion of dUTP to dUMP (Figure 16). To assess whether P; was 
produced, 900 or 1800 ng of PEF was incubated in the presence of 5 mM dUTP in 1x cloned Pfu 
polymerase buffer for 1 hour at 72°C. The reaction mixture was subsequently analyzed for the 
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presence of using the method of Heinonen and Lahti (Heinonen, J.K. and Lahti, RJ. (1981) Anal. 
Biochem. 113: 313-317). Previous results (HPLC) demonstrated that under the above reaction 
conditions, 100% of the dUTP should be converted to product. One hundred percent (100%) 
conversion corresponds to the production of 500 nmol of Pj, if Pj is a product of the reaction. 
However, no Pj was detected in these assays. The assay for Pj was sensitive enough to detect as 
little as about 50 nmoles of Pj. These observations show that Pj is not fomned during the reaction of 
PEF with dUTP. 

In experiments to investigate whether PP; is formed during the reaction, 900 or 1800 ng of 
PEF was incubated with 10 mM dUTP in 1x cloned Pfu polymerase buffer for 1 hour at 72°C. The 
presence of PPj in the reaction mixture was then quantified using Sigma's "Enzymatic Detemiination 
of Pyrophosphate" kit (Sigma Product No. P7275). The kit utilizes a coupled enzyme system 
whereby two moles of NADH are oxidized to NAD* for each mole of PPj present in the reaction 
mixture. The oxidation of NADH is monitored spectrophotometrically at 340 nm. Using this assay 
system, the production of PPj was cleariy established in reactions that contained PEF and dUTP. 
No PPj was detected in control reactions that lacked PEF or that contained PEF and dATP (in place 
of dUTP). The reactions that contained 1800 ng of PEF produced twice as much PPj as those that 
contained 900 ng of PEF. 

The temperature optimum (T^p^) for the generation of PPi from dUTP was measured by 
incubating mixtures of dUTP and native PEF at temperatures ranging from 73°C to 99°C. PPi 
production by native PEF increased steadily over this temperature range and was highest at 99*^0, 
While the T^pt for native PEF was found to >99°C, recombinant P45 exhibited maximal activity 
between SS^'C and 93°C when tested at the same enzyme concentration (dUTP turn-over), but at a 
lower total protein concentration. Accordingly, specific temperature ranges, such as from about 
70^C to about lOO^C, from about SS^'C to about 93^C, or temperatures above about 70*'C, can be 
used with a PEF or polymerase enhancing activity of the invention. 

The enzyme activity of three different preparations of S200-purified native PEF was 
measured at 85*C. Protein concentrations were detemiined by both Bradford and by amino acid 
analysis. Shown below is a summary of the enzyme activity and specific activity (protein 
concentration determined by Bradford or AAA as indicated) of S200-purified native PEF. These 
activites were compared to the minimum amount of purified PEF required to amplify the 5.2kb target 
in the "on/off' assay described in example 1 (1 00)1/1 PGR), 



-50- 



Prep 



PPi production 

ijjmole 

PPi/hr///l) 



1 
2 
3 



1.23 
0.27 
0.14 



Specific activity 
Oi/mole 
PPi/hr///g) 
(Bradford) 



1.76 
0.59 
na 



Specific activity 
(jjmole 
PPi/hr/A/g) 
(AAA) 



4.03 
3.13 
1.37 



Minimum 
amount of 
dUTPase 
required for 
5.2kb PGR 
(nmole PPi/hr) 

0.11 
0.11 

0.22-0.44 



The data indicate that for purified PEF preps 1 and 2, there is an excellent agreement 
between dUTPase activity and PGR enhancing activity. However, native PEF prep 3 exhibited 2-4 
times less PGR enhancing activity, possibly due to its lower specific activity. Prep 3 may contain 
contaminants, which interfere with PGR enhancement. 

PPi fonnation from dCTP was also measured by substituting dGTP for dUTP. For native PEF 
prep 1, the level of dGTPase activity was found to 0.097(//mole PPi/hr///l) at 85°G, which is 12-fold 
lower than the rate of PPi production from dUTP. In addition, recombinant P45 preparations were 
also tested and found to produce PPi from both dUTP and dGTP. Native PEF and recombinant P45 
exhibited a similar degree of preference for dUTP, as compared to dGTP. 

Therefore, recombinant P45 and structurally similar P45 proteins catalyze this reaction in the 
absence of any of the other components of the PEF complex. The methods described here for the 
PPj detection are, thus, a useful tool for analyzing the activity of or detemnining the presence of 
native PEF, PEF analog proteins, recombinant or synthetic PEF proteins, PEF complexes, and rP45. 

3. PEF/dUTP Product Characterization with Mass Spectrometry 

Electrospray mass spectral analysis was perfomned to characterize the byproduct of reactions 
employing PEF and dUTP. Analysis in the negative ionization mode produced a relatively clean 
spectrum exhibiting peaks at 307 and 615 m/e. These masses are consistent with those of dUMP 
[M-H]' and its non-covalent dimer [2M-H]'. Analysis in the positive mode gave a complex array of 
peaks. 
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4. Role of PEF/P45 in Eliminating dUTP Accumulation During PCR 

An understanding of the catalytic activity of PEF/P45 has provided insight into the mechanism 
by which PCR enhancement occurs. Lasken et ai have reported that archeal DNA polymerases, 
such as Vent, incorporate dUTP at approximately 40% the rate of TTP (Roger S. Lasken, David M. 
5 Schuster, and Ayoub Rashtchian, (1996) J. Bioi Chem. 271; 17692-17696). However, further DNA 
synthesis by archeal DNA polymerases appears to be inhibited by dU-containing DNA. Inhibition 
appears related to the 6500-fold greater affinity of Vent for dU-containing sites, as compared to dT- 
containing DNA. Based upon these observations, Lasken has proposed that archeal DNA 
polymerases may play a role in repairing dU-containing DNA in vivo, 
10 One possible mechanism for the PCR enhancing activity of PEF/P45 is that its associated 

dUTPase activity may convert any dUTP present during PCR to dUMP, a side-product which should 
not interfere with DNA polymerase activity. By so doing, dUTP would not be available for 
incorporation into the PCR product, and hence dU-DNA inhibition of the DNA polymerase would not 

:.g occur. Such a mechanism is consistent with the increased PCR product yields generated by archeal 

i=^il5 DNA polymerase in the presence of PEF/P45. 

dUTP, however, is not intentionally added to PCR reactions but may be generated by 

: spontaneous deamination of dCTP during the high temperatures used in PCR cycling (Jens-Peter 

y Horst and Hans-Joachim Fritz, (1996) ne EMBO Journal 15; 5459-5469). To assess the extent of 

dCTP deamination to dUTP during PCR, we incubated dCTP (10mM) at 95**C (PCR denaturation 

i:|20 temperature) in the presence of cloned Pfu DNA polymerase PCR buffer. The products of heat 

treatment were analyzed by reverse phase HPLC (as described above) and peak assignments were 

eg made by comparing retention times to those of standards. Figure 18 shows the results obtained 

when dCTP is heated for 1 hour (panel B) or 4 hours (panel C) at 95°C. dCTP breaks down into 2 
predominate species, dCDP (22%/1 hr; 41%/4hr.) and dCMP (2.8%/1 hr.; 25%/4 hr). A minor peak 
25 at 12.38 minutes was also produced (.049%/1 hr.; .116%/4 hr.), which was identified as dUTP, 

based upon its retention time (dUTP standard = 12.10 minutes; panel D) and absorbence maximum. 
As shown in Figure 19, the 12.38 minute peak generated by heating dCTP (spectaim 2) exhibits 
maximal absorbence at approximately 260nm, as does the dUTP standard (spectmm 3), while dCTP 
absortDS maximally at approximately 270nm (spectmm 1), 
30 dUTP production was also measured during PCR cycling. A dCTP solution (10mM dCTP in 

cloned Pfu DNA polymerase PCR buffer) was subject to PCR cycling in a RoboCycler 40 using the 
cycling conditions described for the 5.2kb "on/ofP' system (example 1). Products generated during 
PCR were analyzed by HPLC as described above. After 30 cycles, the following products were 
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evident: 79% dCTP, 19% dCDP, 1.8% dCMP, and .064% dUTP (Figure 20, panel A). The dUTP 
generated during PGR cycling of dCTP exhibited a retention time (12.1 min.) and absorbence 
maxima (263nm) which were very close to those of the dUTP standard (11.9 min.; 263nm) (Figure 
20, panel D). When dCTP was cycled in the presence of Pfu DNA polymerase (SOX PGR 
conditions, 10mM dGTP and 1.25U Pfu///I reaction mix) there was no difference in the amount of 
dGDP, dGMP, or dUTP produced (Figure 20, panel G). However, when dGTP was cycled in the 
presence of PEF (SOX PGR conditions, lOmM dGTP and O.Sng PEF///I reaction mix), the following 
products were produced: 73% dGTP, 19% dGDP, and 8% dGMP (Figure 20. panel B). There was 
no detectable dUTP generated when dGTP was cycled In the presence of PEF, consistent with an 
associated dUTPase activity of PEF. 

The increase in dGMP production in dGTP + PEF samples (8%) as compared to dGTP + Pfu 
samples (1.7-1.8%) shows that in addition to eliminating the minor dUTP deamination product during 
PGR, PEF will also convert dGTP to dGMP. In this experiment (50x PGR conditions), the final dCTP 
concentration post-PGR was 73% in PEF-containing reactions and 79% in those lacking PEF. This 
slight drop in the dGTP pool is not anticipated to affect PGR product yield or DNA polymerase 
replication fidelity significantly. However, it is anticipated that the use of higher amounts of PEF in 
PGR (»1ng per 100/#l reaction) will be deleterious due to dCTP reactivity. If high amounts of PEF 
are used, it Is possible that the dGTP pool could fall below levels required for maximal yields and 
lowest misinsertion rates. As described elsewhere, we have observed inhibition of replication or 
amplification and/or smearing of products with the use of excessive amounts of PEF. 

5. PEF Reverses Inhibitory Action of dUTP In Amplification Reactions 

Although Lasken reported that the incorporation of dUTP in the nascent DNA strand only 
inhibited archael polymerases by 40% (Lasken, et al. (1996) J. Biol. Ghem. 271; 17692-17696), we 
unexpectedly found that the presence or addition of small amounts of dUTP into PGR reactions had 
more dramatic inhibitory consequences, as demonstrated below. 

A relatively small (0.9kb) fragment of the human ff1 -antitrypsin gene was amplified in the 
absence or presence of dUTP. PGR reaction mixtures contained the following (in a 100 ;/l volume): 
1x Gloned Pfu polymerase buffer; 200 //M each, dGTP, dGTP, dATP; 200 ng oligo F91-23 (100 
ng/A/l); 200 ng oligo R980-23 (100 ngZ/i/l); 125 ng Human genomic DNA; 2.5 units Pfu DNA 
polymerase; 200 //M total of (dTTP and dUTP) or (dTTP + PEF generated dUMP). 



-53- 



F91-23 5' GAGGAGAGCAGGAAAGGTGGAAC 3' (SEQ ID NO: 84) 

R980-23 5 ' CTCCATGTCCCAACTCCGATCAC 3 ' (SEQ ID NO: 85) 



PEF generated dUMP was prepared as described in Example 11, section 1, and purified by reverse 
phase HPLC. 

5 PGR cycling was earned out as follows: SS^'C for 1 minute (1 cycle); 95°C for 1 minute - 

58**C for 1 minute - 72^0 for 2 minutes (30 cycles). 

The PGR products were examined on a 1% agarose, 1x TBE gel as shown in Figure 21. The 
amplification of the 900 bp product from human genomic DNA was completely inhibited with dUTP 
concentrations as low as 2//M (1% of the dTTP+dUTP pool) and partially inhibited at 0.2//M (0.1%). 
10 The concentration of Individual nucleotides in a standard PGR reaction is about 200//M. dUTP 

inhibition can be completely prevented when PEF or the recombinant P45 protein (rP45) is added to 
PGR reactions containing dUTP. Seven ng of PEF can reverse the inhibition caused by as much as 
J 20 fjM dUTP (data not shown). The preferred amount of PEF or P45 used in a particular reaction 

can be optimized according to the principles provided here or by methods for quantifying 
);^5 amplification reactions known in the art. 

Unlike dUTP, the PEF generated byproduct, dUMP, was not inhibitory in Pfu polymerase- 
based PGR reactions, even when present at concentrations of 20 //M. In Figure 21, amplifications 
containing purified dUMP appear less robust than other bands on the gel. It should be noted that 
W the other PGR products on this gel were generated by Pfu in the presence of PEF or rP45, which 
|20 has been demonstrated to enhance PGR product yields as compared to amplifications conducted in 
^ the absence of PEF/rP45. These results are consistent with the enhancing activity of PEF and rP45 

being a result of dUTPase activity. The dUTPase activity may hydrolyze dUTP and thereby prevent 
the incorporation of dUTP into DNA. As demonstrated in Figure 21, dUTP incorporation by Pfu DNA 
polymerase during PGR can significantly decrease PGR product yields. Accordingly, the invention 
25 comprises a method of enhancing nucleic acid replication or amplification reactions by reducing the 
dUTP concentration or preventing the incorporation of dUTP into replicated or amplified products, as 
well as compositions that are capable of preventing that incorporation. 

We also tested whether PEF/rP45 could reverse the inhibition caused by uracil-containing 
DNA. PGR amplification was canied out in the presence of a third unrelated primer, which contains 
30 9 dUs instead of dTs (dU oligo). Primers complementary to M13 DNA were synthesized. 



-54- 



Control Oligo 5 ' ggttttcccagtcacgacgttgtaaaacgacggccagt 3' (SEQ ID NO: 86) 

dU OligO 5 ' GGUUUUCCCAGUCACGACGTOGUAAAACGACGGCCAGU 3' (SEQ ID NO: 87) 

The 900 bp a1 -antitrypsin fragment was amplified in the presence of the oligos, added at 
levels ranging from 200ng (16 pmole) to 0.2 ng (0.016 pmole) per lOOul reaction. In addition, 
5 similar reactions were performed with Taq DNA polymerase instead of Pfu DNA polymerase. In 
Figure 22, the control oligonucleotide shows inhibition of the Pfu-based PGR reaction at high 
concentrations (16 and 8 pmole). In contrast, inhibition by the dU oligo was readily observed at 
concentrations as low as 0.8 pmole. It seems likely that both oligonucleotides sequester Pfu DNA 
polymerase away from the PGR primers and genomic DNA, but the dU oligo can inhibit at lower 
10 concentrations. These results show that Pfu DNA polymerase may be more tightly bound to a dU 
containing substrate. With the addition of PEF, a small amount of amplification product is observed 
in reactions containing 0.8 pmole of dU containing primer. 

1 The enhancement by PEF in the dU oligo-inhibited reaction could be achieved through at 
least two possible pathways. The most likely explanation is that PEF is having no effect on the dU 

-15 containing oligonucleotides and is simply increasing the activity of Pfu DNA polymerase by 

2 scavenging dUTP generated during PGR by heat- or chemically-induced deamination of dGTP (eg., 
y lanes 2 and 3 of Figure 22). Or, PEF may work with Pfu DNA polymerase to remove the uracil 

^ moieties from the oligonucleotides through a repair pathway. Thus, PEF may also be used in repair 

S reactions employing appropriate polymerase activities. Repair reactions are known in the art and 
^0 methods to adopt the use of PEF into those reactions can be devised by those skilled in the art. 

3 The knowledge of potential PEF mechanisms of action described here allows those skilled in 
the art to employ other dUTP modifying enzymes in enhancing polymerase reactions. A definition or 
one of these other modifying enzymes can be an enzyme that diminishes the capacity to incorporate 
dUTP by polymerases or at least partially inhibits dUTP incorporation. Assays used to identify and 

25 characterize PEF as described herein can also show other dUTP modifying enzymes. These other 
modifying enzymes could also mimic the enhancing attributes of PEF or a particular protein, such as 
P45 or rP45, An example of this class of enzyme would be dUTP pyrophosphatases (EG 3.6.1.23), 
such as deoxyuridine 5'-triphosphate nucleotide hydrolase, as well as other enzymes involved in 
dUTP metabolism, catabolism, or synthesis. These other enzymes may be used alone or in 

30 combination with PEF or other proteins or enhancing additives. 

Furthermore, the presence of the consensus uridine-binding motif or the related sequences 
shown herein can also be used to define an enzyme or protein that is a PEF. Thus, a protein the 
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comprises SEQ ID NO.: 72, or any one of SEQ ID NOs.: 72-81 , or combinations of these sequences, 
may be a PEF according to this invention. 

Example 12 

Identification of Other Proteins for Enhancing Polymerase Activity 

5 The structural infomnation, in the amino acid and nucleotide sequences, as well as the 

functional infomnation described here allow one skilled in the art to identify polymerase enhancing 
and/or dUTPase activities from a variety of sources. For example, we have shown above how 
degenerate probes made from the amino acid sequences of P50 and P45 can be used to clone 
nucleotide sequences encoding polymerase enhancing and dUTPase activities, or PEF. Since we 
10 have identified the importance of dUTPase activity in controlling and enhancing polymerase 

reactions, such as PGR. structural infomnation available for any dUTPase can be put to a new and 
advantageous use in identifying and producing proteins for enhancing polymerization reactions. 
m Furthemnore, the assays described can be used to identify the presence of dUTPase activity from 
any source. 

y15 1. Cloning Human dUTPase as a Representative Eukaryotic Protein for Enhancing 

Polymerization Reactions 

To detemnine if other enzymes with dUTPase activity could also produce polymerase 

r| enhancing activity, we cloned a representative eukaryotic protein, human dUTPase. Total RNA was 

□ isolated from human placenta and converted to cDNA as follows: 5 //I total human RNA, 5 fj\ oligo 

rSgO dT (0.1 fjglfjl), 1 A/I Moloney murine leukemia virus reverse transcriptase (40 u///l), 1 /il 100 mM 

Q dNTPs, 5 //I lOx first strand buffer, 33 /il DEPC-treated water (where 1x first strand buffer is 50 mM 

Tris-HCL (pH 8.3), 75 mM KCI, 10 mM DTT, and 3 mM MgCy. The reaction was incubated at 37°C 

for one hour. A negative control was run in parallel without reverse transcriptase. 

Primers containing a sequence specific to the 5* and 3' termini of one of the human dUTPase 

25 genes were synthesized and are shown below. The accession numbers for the cDNA sequence of 

Human deoxyuridine triphosphatase (DUT) are gi|1421817[gb|U62891{HSU62891. These primers 

also shared sequence with the vector pCAL-n-EK (in bold print below), which allowed ligation 

independent cloning (LIC) of the amplified product, as described in Example 10. 

Primer 285 gacgacgacaagatgccctgctctgaagagacacc (SEQ ID NO.: 88) 

30 Primer 286 ggaacaagacccgtttaattctttccagtggaacc (SEQ ID NO.: 89) 
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Prior to PGR, the reverse transcriptase was heat inactivated by incubating the reaction at 80^C for 5 
minutes. The dUTPase sequence was amplified in a 100 //I reaction containing 1x cloned Pfu 
polymerase buffer, 200ng of each primer, 200 //M dNTPs, 2.5 units of Pfu DNA polymerase, 3 ng of 
PEF complex and 3//I of human placenta cDNA from the previous section. 

The reactions were amplified under the following conditions: 95^*0 for 3 minutes (1 cycle); 
95<»C for 1 minute - 50^C for 1 minute - 72°C for 2 minutes (30 cycles). The amplified reaction was 
examined on a 1% agarose gel to confimt that the product exhibited the correct size before 
purification. The purified product was cloned into the expression vector pCAL-n-EK, as described in 
Example 10, and transfomned into XL1-Blue cells. Three clones were confinned to contain human 
dUTPase by sequencing of the first 500 bases. After the transfomnants were shown to contain the 
dUTPase sequence by PGR amplification, their plasmids were harvested and used to transform the 
£ coli strain BL21 (DE3). 

2. Human dUTPase Expression and Activity Analysis 

The BL21/dUTPase clones were induced with IPTG and the expressed protein was purified 
by means of the calmodulin binding peptide (GBP) tag expressed as a fusion protein at the amino 
terminus of the dUTPase sequence. The fusion protein was purified on calmodulin agarose, as 
described in example 10. The protein products were analyzed by SDS-PAGE and found to be of 
the correct molecular weight. 

To confimn that the dUTPase clones were active, the Sigma pyrophosphatate assay (see 
Example 11) was utilized. The assay demonstrated that all of the clones tested could convert dUTP 
to dUMP + pyrophosphate. The human dUTPase enzyme was thennolabile and became completely 
inactive after a one minute pre-incubation at 70°G. 

Polymerase enhancement was also detected with the 5.2 kb on/off assay. The assay was 
modified from the protocol described in Example 1 to allow detection of the thermolabile PEF 
activity. A PGR cocktail was mixed to provide an identical starting point for all samples. Ninety-nine 
microliters of the cocktail was aliquoted into six thin-walled, 0.5 ml tubes. The reactions contained 
278 ng of hurnan genomic DNA, 200 ng of each primer (see Example 1), 200 //M each dNTP, 2.5 
units of Pfu DNA polymerase in 1x cloned Pfu polymerase buffer. At each 60**G annealing step, 0.5 
//I of one the following were added: human dUTPase preparation, a 1/1 0th dilution of the human 
dUTPase preparation, 2 ng///l rP45 (positive control), or dUTPase storage buffer (negative control). 
Both human dUTPase reactions were mn in duplicate. The samples were cycled as follows: QS^'G 
for 1 minute (1 cycle); 95°G for 1 minute - 60^G for 1 minute - 72°C for 5.2 minutes (30 cycles). 
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10 //I of each PGR reaction was visualized on a 1% agarose, 1x TBE gel by ethidium bromide 
staining. (See Figure 23) The undiluted human dUTPase preparation was able to enhance the 
polymerase activity to produce the 5.2 Kb band. As demonstrated by the negative control, lack of 
any enhancing factor results in a failed PGR. The positive control for this experiment, rP45, did not 
5 work in this experiment, presumably because an excessive quantity (30 ng total) was added. 

Example 13 

Production of Antibodies to PEF and Western Blot Analysis 

1. Production of anti-PEF and anti-rP45 IgG 

PEF-specific IgG was purified by immunoaffinity chromatography from the sera of rabbits 
10 previously immunized against a lot of native Pfu DNA polymerase containing PEF (see Figure 24 
showing gel of purified native Pfu polymerase preps). The S-200-purified Pfu PEF was covalently 
coupled to AffiGel 10 (BioRad: Hercules, GA) in the presence of 20mM HEPES, 1mM DTT, 50mM 
I J KGI, 0.05% Tween 20, 1 mM EDTA, and 10% glycerol, following the manufacturer's recommended 

protocol. Rabbit sera (2.4 ml) was loaded onto a 0.2 ml column in the presence of 10mM Tris (pH 
M^5 7.5). The column was washed extensively and the specific IgG was eluded with 0.1M glycine-HGl 

(pH 2.5) followed by 0.1 M triethylamine (pH 1 1.5). Using a Gentricon-30. the IgG was concentrated 
Id and the elution buffer replaced with PBS. 

1=^: In addition, sera containing rP45-specific IgG was obtained by immunizing rabbits with 

M recombinant P45, which was prepared as a tagged fusion protein, as described in example 10, 

;!^0 section 2. The purified enzyme (0.177 mg/ml) was used to immunize two New Zealand white rabbits 
Q using the following immunization schedule: 90 //g/rabbit in Complete Freund's Adjuvant (GFA); 18 

days later, boost with 45 /#g/rabbit in incomplete Freund's adjuvant (IFA); 39 days later, second 

boost; 45 days later, obtained serum sample for Westem blot. 

2. Western Blot Analysis Using anti-PEF Antibodies. 

25 Gell extracts were prepared by suspending cells in 4X 50mM Tris, pH 8.2, 10mM BME, ImM 

EDTA, and 10% glycerol, followed by sonication. Then, 2.5mM PMSF was added and the cellular 
debris removed by centrifugation for 15 minutes at 14,000 rpm. PEI was added to the supernatant 
to a final concentration of 0.9% and the mixture centrifuged again. The supematants (10 //I) were 
electrophoresed on 4-20% SDS-PAGE gels and the proteins transfen-ed to nitrocellulose by 

30 electroblotting. The blots were blocked with 1% Blotto/PBS for 1 hour at room temperature and then 
incubated with PEF-specific IgG overnight at 4^G. The blots were washed in PBS-,05% Tween 20, 
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and then incubated with alkaline phosphatase-conjugated goat anti-rabbit IgG. The blot was washed 
and then incubated in color development solution (100 mM Tris-HCI, pH 9.5, 100 mM NaCI, 5 mM 
MgClj, 0.3mg/ml NBT, and 0.15 mg/ml BCIP) for approximately 1-2 minutes. The enzyme reaction 
was stopped and the membrane was washed five times with deionized water. 
5 Figure 25 depicts the results of the Western Blot. Extracts were prepared from 5 different 

fermentations of P. furiosus (lanes 1-5). In addition, three extracts or partially purified column 
fractions from 7. aquaticus (lanes 6-8) and one extract from E coli (lane 9) were also mn. Purified 
PEF (550 ng) and pre-stained molecular weight markers were run in lanes 10 and 11, respectively. 
With the exception of the mariners, all samples were boiled in SDS/BME dye prior to loading. The 
10 results show PEF-specific IgG binds to and cross-reacts with components of the PEF complex in 
crude Pfu extracts, including the P50 and P45 components. In contrast, no cross-reaction was 
observed with extracts from 7, aquaticus or £. coli. 

Q 3. Western Blot Detection of Native PEF and Recombinant P45 Using antl-rP4S Sera 

Native PEF samples were electrophoresed on a 4-20% gradient Tris-Glycine SDS gel. The 
45 samples were loaded without denaturation (P300 form) or after partial (boiling in 2%SDS; P45 form) 
'I or complete (boiling 2%SDS plus 1%TCA) denaturation. The samples were transfen^ed to 
y nitrocellulose and the blots developed as described above, except that sera from rabbits immunized 

-I with recombinant P45 was used (diluted 1 :000). 

f In Figure 26, antibodies specific to recombinant P45 crossreact with bands in undenatured, 

Jo partially denatured, and fully denatured samples of native PEF. The predominant bands are the 
3 P300 aggregate (P45/P50) present in the unheated lane, the 45kD (partially denatured) fonn present 
in the boiled/SDS sample, and the 17kD (fully denatured) form present in the boiled/1% TCA/2% 
SDS lane, which presumably represents the fully denatured monomer. In addition to these different 
aggregation states of native P45, there are also minor bands present in the Western blot, which may 
25 represent additional forms of P45 or the cross-reaction of antibodies to Pyrococcus proteins sharing 
common epitopes with E coli contaminants present in the recombinant P45 preps. 

As with the PEF-specific IgG from above, anti-rP45 sera can also be used to identify 
immunochemically-related proteins from other species. In Figure 27, the rP45-specific IgG was used 
to probe a blot containing cellular extracts from bacteria {Bacillus species, T. aquaticus), archea {P. 
30 furiosus, M. therwoautotrophicum, and an uncharacterized species) and human cells (HL60 
lymphoma). The results obtained show the antibodies bind to proteins from P. furiosus, the 
uncharacterized species, and the human cell line, which migrate with the same apparent molecular 
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weight as purified native PEF. The antibodies also bind weakly with proteins in the Bacillus and M. 
thenvoautotropicum extracts, although these bands migrate with a different apparent molecular 
weight than P. furiosus PEF species. As with the PEF-specific IgG, no cross-reactivity was detected 
with T. aquaticus extracts. 

In a separate Western assay, samples from Thermus thermophilis were run out on an SDS- 
PAGE gel and probed with the rP45 antisera. The rP45 antibody binds to a band of approximately 
24kD (between markers of 16kD and 30kD) and another band approximately twice that size, 
presumable a dimeric or multimeric fonn. This demonstrates that PEF activity or protein is present 
in the T. thenvophiiis samples. The PEF as dUTPase activity in these samples was further 
confirmed by a dUTP conversion assay employing reverse phase HPLC. The assay detected the 
turn-over of dUTP with the T. thermophilis samples, but a control sample showed no detectable turn- 
over of the dUTP present. 

Example 14 

Use of PEF Complex in Nucleic Acid Replication Reactions 

Initially and as a control to confimi the activity of the DNA polymerase used, gapped-duplex 
calf thymus DNA (Phannacia) assays were performed. The polymerase cocktail contained 50mM 
Tris-HCI, pH 8.0, 5mM MgClj, 1mM DTT, 50Aig/ml BSA, 4% glycerol, 200//M each dNTP, [^H]TTP 
(.5mCi//miole final concentration), and 2£)0//g/ml of activated calf thymus DNA (Pharmacia). 
Samples containing Pfu DNA polymerases or P. furiosus PEF were serially diluted in Pfu DNA 
polymerase storage buffer (50mM Tris-HCI, pH 8.2, 0.1% NP-40, 0.1% Tween-20, O.ImM EDTA, 
ImM DTT, 50% glycerol) and then 1//I of each dilution was added to 10//I aliquots of polymerase 
cocktail. Polymerization reactions were conducted in triplicate for 30 minutes at 72''C. The 
extension reactions were quenched on ice, and then 5//I aliquots were spotted immediately onto 
DE81 filters (Whatman). Unincorporated [^H]TTP was removed by 6 washes with 2xSCC (0.3M 
NaCI, 30mM sodium citrate, pH 7.0), followed by one wash with 100% ethanol. Incorporated 
radioactivity was measured by scintillation counting. The assay was calibrated by counting a known 
amount of [^HJJTP on DE-81 filters, omitting the wash steps. One unit of polymerase activity is 
defined as the amount of enzyme which catalyzes the incorporation of 1 0 nmoles of total dNTP into 
polymeric form (binds to DE-81 paper) in 30 minutes at 72*'C. Polymerase concentrations (U/ml) 
were extrapolated from the slope of the linear portion of units vs. enzyme volume plots. 

The PEF samples tested exhibit no significant DNA polymerase activity while the Pfu DNA 
polymerase exhibited a specific activity of 2-4x10* u/mg. 
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enhancing activity assay. 



2, Screening Assays for PCR-Erihancing Activity Using Pfu-derived Samples 

A number of amplification assays can be designed to detect the presence or absence of PEF 
activity, and/or compare PEF activity betv^een samples. Generally, these tests employ a sample 
containing a rare sequence to be amplified. The sequence is so rare, or the conditions so designed, 
that amplification under normal situations results in barely detectable or no detectable amplified 
product. By adding a sample with putative PEF activity, any effects on the amount of amplified 
product fonned can be detected. 

One particular screening assay is called the "On/or assay, which detects the presence or 
absence of PEF. The "On/Off' assay results in appreciable amplified product only when PEF activity 
is present, or a detectable difference in amplified product when PEF activity is present compared to 
when PEF is not present. Methods for detecting the amount of amplified product are known in the 
art and include those using electrophoresis and hybridization. 

One embodiment of an assay used to screen for PEFs, in this case from Pyrococcus 
furiosus, involved amplifying a 6.2 kb target from transgenic mouse DNA (6.2 kb primer-template 
assay). A master PGR cocktail was prepared consisting of: 200a/M each dNTP, 2 /ig/ml primer 
F432-21 (5'-CTA-TTG-AGT-ACG-AAC-GCC-ATC)(SEQ ID NO.: 62), 2//g/ml primer R6656-20 
(GTC-ACG-CTT-GCT-CCA-CTC-CG)(SEQ ID NO.: 63), 2.5 //g/ml ylAA742 DNA (transgenic mouse 
DNA having 40 copies of a lambda phage DNA shuttle vector), 1x cloned Pfu DNA polymerase PGR 
buffer (10mM KGI, 10mM (NHJjSO^, 20mM Tris-HCI (pH 8.8), 2mM MgSO^, 0.1%(v/v) Triton X-100, 
and 100//g/ml BSA), and 25U/ml cloned Pfu DNA polymerase. Samples were diluted in 1x cloned 
Pfu DNA polymerase PGR buffer, and of each diluted sample was added to 24//I of the PGR 
cocktail. PGR amplifications were conducted on the RoboGycler 96 Temperature Gycler 
(Stratagene), using the following conditions: 96**G for 45s (1 cycle)/ 96''G for 45s; 60'G for 45s; 
72''G for 14 min. (35 cycles)/72''G for 10 min. (1 cycle). PGR products (10 //I/lane) are mn out on 
1% agarose gels and PGR product bands visualized by ethidium bromide staining. Samples with 
PGR enhancing, activity exhibit higher yields of the expected 6.2 kb PGR product than can be 
obtained in the presence of DNA polymerase alone. When PGR enhancement is due to the 
presence of PEFs, rather than contaminating endogenous DNA polymerase activity, amplifications 
performed in the absence of exogenous DNA polymerase (e.g. Pfu DNA polymerase) yield no PGR 
product. Moreover, PGR amplifications perfomied in the absence of exogenous DNA template 
should yield no PGR product when the PEF sample lacks contaminating target DNA. 
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1. Enhancement of Cloned Pfu DNA Polymerase with Pffii PEF 

P. furiosus PEF has been demonstrated to enhance the yield of PGR products generated 
with recombinant Pfu DNA polymerase using plasmid, lambda, and genomic DNA templates (Figures 
28-30). The results demonstrate that the addition of P. furiosus PEF increases PGR product yield 
for a variety of PGR systems, ranging in target complexity. Relatively easy targets, e.g. plasmid 
DNA, can be successfully amplified with F'fu DNA polymerase, and the addition of PEF further 
increases product yield (Figure 28). We have found that fewer PGR cycles or lower template 
concentrations can be used in PEF-containing reactions, demonstrating the advantageous properties 
of the proteins and compositions described and the methods employing them. 

The most dramatic enhancements were observed when long (Figure 29) and/or highly 
complex targets (Figure 30) are amplified. The presence of PEF unexpectedly and significantly 
improves the amplification of these targets, which are often poorly amplified by single enzyme PGR 
reactions. In Figure 29, the addition of 1-100 ng of PEF (S200-purlfied PEF; prep. 3) to 100//I PGR 
reactions containing 5U of Pfu DNA polymerase significantly increased yields of a 10kb PGR 
product. In Figure 30, a 5.2kb target was successfully amplified from human genomic DNA in the 
presence of .3-280ng PEF (SGS #52 S200 purified) per 100^/1 PGR, but not in the absence of PEF, 
despite the use of 1.9 min. per kb extension times. 

P. furiosus PEF has been found to enhance the yields of both specific and non-specific PGR 
products, when amplifications are conducted under less stringent PGR conditions with PGR primers 
that hybridize to more than one target. £ coli ssb (single-stranded binding protein; Stratagene's 
PerfectMatch) has been shown previously to increase the specificity of primer extension reactions, 
presumably by minimizing the fomnation of pooriy matched primer-template complexes. When used 
in conjunction with E. coli ssb, P. furiosus PEF has been found to enhance the yield of specific PGR 
products (Figure 31). Pfu PEF also enhances yields of PGR products obtained with exo-Pfu and a 
mixtures of Taq and Pfu polymerase (for example, TaqPlus Long™, Stratagene; La Jolla, GA). 
Therefore, Pfu PEF is useful with polymerase mutants, taincated versions of polymerases, mixtures 
of polymerases, and polymerase-additive combinations (for example. Perfect Match®, Stratagene). 

2. Enhancement of Native Pfu DNA Polymerase with Pfu PEF 

Subsequent to identifying PEF from P. furiosus, we recognized that certain lots of native Pfu 
DNA polymerase preparations contained PEF. Varying amounts of the >250kD aggregate could be 
detected on silver-stained SDS-PAGE gels (e.g., lots 38 and 46 in Figure 24). Eleven of the 23 
preparations examined were found to visibly contain low levels (8/11 lots; 0.1-1% total protein) to 
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high levels (3/11 lots; 10-30% total protein) of PEF. PEF co-migrates with Pfu DNA polymerase 
during the initial Q- and SP-Sepharose columns in Pfu DNA polymerase purification, and elutes just 
after the major peak of Pfu DNA polymerase activity on the Heparin Sepharose column (Figure 32). 
Pfu DNA polymerase fractions pooled after the Heparin Sepharose step are typically contaminated 
5 with varying amounts of PEF, depending upon the column efficiency and pooling strategy employed. 

Contamination of native Pfu DNA polymerase with varying amounts of PEF could potentially 
contribute to lot-to-lot variation in the peifomiance of native Pfu DNA polymerase in PGR. It is 
expected that lots containing approximately 1-100 ng of PEF per 2.5U of Pfu DNA polymerase will 
give rise to higher PGR product yields than amplifications conducted with cloned Pfu DNA 
10 polymerase or native Pfu DNA polymerase lots contaminated with < 10 pg per 2.5U Pfu DNA 
polymerase (<.02 % total protein). In theory, a lot containing certain PEF concentrations would 
exhibit reduced Pfu DNA polymerase peirfomiance, based upon the apparent inhibition of PEF at 
high concentrations discussed below (>900 ng per 2.5U Pfu DNA polymerase in IOOaiI PCRs). 

When adding PEF to native Pfu DNA polymerase PGR amplifications, it is anticipated that the 
15 level of PEF contained in a particular lot of native Pfu must be taken into account to avoid smearing, 
inhibition of synthesis, or sub-optimal enhancement. 

Example 15 
Use of PEFs in Amplification Reactions 
1. Activity of Pfu PEF in a Standard PGR Protocol 

20 To enhance PGR product yield, P. furiosus PEF is added, separately or pre-mixed with the 

DNA polymerase, to standard PGR amplifications. PGR amplification reactions generally consist of 
the following: .01-.05U/aiI DNA polymerase, 2.5 pg///l to 1 ng/fA P. furiosus PEF, 1-5 ng///l of each 
primer, .5-10 ngZ/A of genomic DNA template, and 0.2mM each dNTP in a suitable buffer (e.g., 
cloned Pfu DNA polymerase buffer consists of lOmM KGI, lOmM (NHJjSO^, 20mM Tris-HGI (pH 

25 8.8), 2mM MgSO^, 0.1%(v/v) Triton X-100, and lOOngZ/i/l BSA). Amplifications from less-complex 
templates (e.g.. lambda or plasmid DNA) are typically successful with 0.1-100 pg//wl DNA, The 
minimum amounts of native PEF typically used in PGR correspond to amounts of enzyme sufficient 
to catalyze the release of 0.1 to 0.4 nmole PPi/hr per 100//I PGR reaction (see example 11, section 
2). PGR amplifications are conducted under standard conditions; e.g., 94-96°G for 0.5-2 min. (1 

30 cycle)/ 94-96''G for 0.5-2 min.; 50-65°G for 0.5-2 min.; 68-72«G for 0.5-3 min. per kb of target 
amplified (30-40 cycles)/72''G for 0-10 min. (1 cycle). 
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2. Enhancement of PCR Amplification Reactions Employing DNA Polymerases 
Other than Pfu DNA Polymerase 

P. furiosus PEF has been found to enhance the perfomiance of other a-type (Family B- 

related) DNA polymerases from thermophilic archea. Enhanced PCR product yields were observed 

5 when P. furiosus PEF was added to amplifications conducted with DNA polymerases from both 

Pyrococcus and Thermococcus species. DNA polymerases demonstrated to function with P. 

furiosus PEF include: Pwo DNA polymerase (Boehringer Mannheim; cloned from P. woesei), Deep 

Vent DNA polymerase (New England Biolabs; cloned from P. sp. GBD), JDF3 DNA polymerase 

(Stratagene; cloned from P. sp. JDF3), ES4 DNA polymerase (Stratagene; purified from P. sp. ES4, 

10 renamed P. endeavori) and Vent DNA polymerase (New England Biolabs; cloned from T. litoralis). 

In Figure 33, S200-purified Pfu PEF (prep. 1) increased yields of a 6.2kb PCR product 

obtained with 2.5U of Pwo DNA polymerase. In Figure 34, the addition of S200-purified PEF (prep. 

3) to JDF3 DNA polymerase PCRs increased the yield of a 10kb product amplified from lambda DNA 

w (lanes 5-7; 11-13) and mouse genomic DNA containing 40 copies (lanes 8-10) or 1 copy (lanes 14- 

:=i5 16) of a lambda DNA transgene. In JDF3 DNA polymerase-based PCRs, amplifications are typically 

W conducted with 1U of enzyme and extension times of 0.5 min./kb target. In Figure 35, the addition 

jji of 5 ng of S200-purified PEF (prep. 3) to ES4, JDF3, Pfu, and Vent DNA polymerase PCRs 

increased the yield of a 1.9kb product amplified from P. furiosus genomic DNA. In Figure 36, the 

addition of P. furiosus PEF was also shown to increase yields of a 2kb PCR product amplified with 

;:|0 Deep Vent DNA polymerase from transgenic mouse genomic DNA. 

i;3 The addition of P. furiosus PEF may not enhance the yield of PCR products generated with 

Taq DNA polymerase (Figure 34, lanes 1-4 and Figure 23, lanes 9-10). Taq DNA polymerase is a 

Q Pol l-like (Family A-related) DNA polymerase isolated originally from the thennophilic eubacteria 
Thermus aquaticus. 

25 3. Enhancement of RT-PCR Reactions 

P. furiosus PEF has also been shown to enhance the yield of PCR products amplified from 
reverse transcribed RNA (cDNA) in a process known as RT-PCR, known in the art. Enhancement 
has been observed in both 2-step (Figure 37) and 1-tube RT-PCR protocols (data not shown). In 
the fomier procedure, aliquots of cDNA synthesis reactions are added to PCR reactions containing a 
30 thermostable DNA polymerase (e.g., Pfu DNA polymerase) and P. furiosus PEF. In the latter 
approach, RNA is added to reaction mixtures containing a thennolabile RT, dNTPs, primers, a 
thermostable DNA polymerase (Pfu DNA polymerase), and P. furiosus PEF. cDNA synthesis and 
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PGR amplification take place sequentially, in the same tube, by conducting cDNA synthesis at 37- 
Al'C, followed by PGR amplification at elevated temperatures. 

In the 2-step RT-PGR procedure, cDNA synthesis is first perfomied by combining the 
following reagents (50/#i final volume): 5fjg total RNA pre-annealed to 300 ng of primer (oligo dT, 
random hexamers, or a gene-specific primer), 4mM each dNTP, 20U RNase block (optional), and 
SOU MMLV RT (or other RT) in buffer containing 50mM Tris-HGI (pH 8.3), 75mM KGI, 3mM MgClz, 
and DEPC-treated water. cDNA synthesis is allowed to proceed for 1 hour at 37-42''C. After heat 
inactivation of the RT, 1//I of cDNA is added to a 50//i PGR reaction containing 5U Pfu DNA 
polymerase, .01-50 ng P. furiosus PEF. of each primer, and 0.2mM each dNTP in buffer 
consisting of lOmM KGI, lOmM (NHJjSO,, 20mM Tris-HGI (pH 8.8), 3mM MgSO^, 0.1%(v/v) Triton 
X-100, and 100//g/ml BSA. PGR can be conducted using the following cycle conditions: 94''C 2 
min.; 60»G 2 min.; 68"'G 1 min. per kb (1 cycle) and then 94»G 1 min.; SCG Imin.; 68°G Imin. per 
kb (40 cycles). 

The enhancement of RT-PGR with P. furiosus PEF was evaluated using PGR primers 
designed to span at least one intron-exon junction in the EGF receptor gene. Two primer sets were 
used (antisense: 5'GAG-TTA-AAT-GGG-TAG-AGT-GTA-TGT (SEQ ID NO.: 29); sense: 5*GAG- 
GAC-TCA-GAA-GCT-GCT-ATC-GAA (SIEQ ID NO.: 30)(1kb) or 5'CTG-GAG-GTG-GGG-TGT-AGG- 
ATT-TGT (SEQ ID NO.: 31)(3kb)), which generate PGR products of Ikb or 3kb, as indicated, when 
amplification occurs from spliced RNA rather than contaminating DNA. 

The PEF concentration which gives optimal perfomiance was detemiined by titrating PEF 
preparation 3 (S-200 purified) and preparation 4 (heparin sepharose fraction) in the 2-step RT-PGR 
procedure described here. With PEF preparation 4, significant increase in the yield of the Ikb 
product was observed when 0.001-1//I was added (10 pg-10 ng PEF) (Figure 37). Synthesis of the 
3kb product was significantly enhanced vi/hen 0.1-1//i (1-10 ng PEF) of preparation 4 was added. 
With PEF preparation 3, significant increases in the yields of both the 0.6kb and the 3kb products 
were observed for all amounts tested in the range of 0.002-0. 1//I (1-50 ng). 

4. Enhancement of Seamless™ Cloning Protocol 

Seamless Gloning was perfomied using Stratagene's Seamless™ Gloning kit (Stratagene; La 
Jolla, GA. 1997/1998 Stratagene Gatalog, specifically incorporated herein by reference), following 
the recommended protocol. The effect of P. furiosus PEF on the efficiency of Seamless Gloning is 
shown in Figure 38. Increased yield of a 7.2 kb PGR product was observed when 5 ng of S-200 
purified PEF (prep. 1) was added to 50fi\ PGR reactions containing 2.5U Pfu DNA polymerase and 
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methyl dCTP. Amplifications conducted in the presence of PEF utilized 1 min. per kb extension 
times. In the absence of PEF. very little PGR product was generated despite the use of longer 2 
min./kb extension times. 

5. Enhancement of Linear Amplification Reactions: QuIkChange™ K/lutagenesis 
Protocol 

Site-specific mutagenesis can be accomplished efficiently with double-stranded DNA 
templates using a linear amplification-based strategy employing Pfu DNA polymerase 
(QuikChange™ Site-Directed Mutagenesis Kit; Stratagene; La Jolla, CA, 1997/1998 Stratagene 
Catalog, specifically incorporated herein iDy reference). PGR primers containing the desired 
mutation(s) are designed to anneal to the same site on opposite strands. Primer extension reactions 
are conducted with a thennostable DNA polymerase (e.g. Pfu DNA polymerase) at temperatures 
which allow efficient synthesis in the absence of strand displacement activity (Sa^G). The 
amplification product is treated with Dpnl to digest the parental methylated plasmid DNA and the 
resulting gapped, double-stranded DNA is then transfonned into competent E. coli cells. Mutant 
clones are identified by DNA sequencing. 

In evaluating P. furiosus PEF, mutagenesis was conducted using Stratagene 's Quik Ghange 
mutagenesis kit, except that both recombinant and native Pfu DNA polymerase were used in the kit- 
supplied reaction buffer. The effect of P. furiosus PEF on the efficiency of QuikGhange mutagenesis 
is shown in Figure 39. The addition of .04 to 4 ng of PEF of PEF prep. 2 (heparin sepharose 
fraction lot 36 H.S. #78; 40ng///l) to 50/#l reactions increased the number of transfomiants generated 
by native and cloned Pfu DNA polymerases, while retaining mutation frequencies of 90-97%. 
Optimal results were obtained with 0.4 ng of PEF, which gave 7.5-fold and 5.3-fold increases in the 
number of mutant colonies generated witl^1 native and cloned Pfu DNA polymerase, respectively. 

The use of PEFs in the QuikGhange™ mutagenesis protocol con-esponds to the use of PEFs 
in other linear amplification reactions known in the art, such as cycle sequencing reactions, primer 
extension reactions, and the like. PEFs can be employed in any linear amplification method to 
enhance the activity of the polymerase used. For example, the effect of Pfu PEF on cycle 
sequencing can be evaluated by comparing the quality and length of sequencing ladders generated 
with a polymerase, for example exo Pfu DNA polymerase, in the absence and in the presence of 
PEF. A number of different cycle sequencing reactions, known to one skilled in the art, can be used 
in combination with the PEF complexes and proteins of this invention to enhance polymerase 
activity. In addition, primer extension readions can also be enhanced with the use of PEFs. 
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Numerous primer extension reactions are l<nown in the art. 

Example 16 
Enhancing Titer of PEF 

The nucleic acid replication enhancing activity of several different preparations of Pfu PEF 
have been evaluated in PGR, PCR-related applications, linear amplification-based applications, 
mutagenesis applications, cycle sequencing applications, and primer extension applications. One 
skilled in the art will appreciate that similar methods to optimize the use of any PEF, such as those 
specifically discussed herein, are apparent from the disclosure herein. A sample of substantially 
homogeneous PEF (e.g. S200-purified) enhances the performance of Pfu DNA polymerase in PGR 
amplification reactions when added at concentrations spanning a 10,000-fold range (0.09-900 
ng/100/[/l). The highest yields of amplified product are obsen/ed in the presence of «1 to 100 ng of 
P50. The addition of excess PEF (>900 ng/100//l, where protein concentration was determined by 
the silver-staining intensity of the P50 band as compared to known protein standards) or very low 
PEF concentrations ( < 9 pg/100//l) in a PGR reaction resulted in lower PGR product yield. 

The relative purity and PEF content of 4 preparations was examined by SDS-PAGE analysis 
(Figure 10). Preparations 1 and 3 consist of S200-purified PEF of >95% homogeneity, while 
preparations 2 and 4 consist of concentrated heparin sepharose fractions of 10-20% homogeneity. 
The PGR enhancing titer of S200-purified P. furiosus PEF (prep. 1; Figures 40 and 41) was 
detemnined using the F432-21/R6656-20/ilAA742 primer-template system described in example 1. 
This preparation is approximately 95% pure, contains <,001U///I DNA polymerase activity, and « 
225 ng/fA PEF. PGR enhancing activity was found to be optimal when .004-.4/il (.9-90 ng) of 
homogeneous P. fun'osus PEF was added to 100//I PGR reactions containing 2.5U of Pfu DNA 
polymerase. Reduced, but significant, PGR product yield was observed when 4//I (900 ng) or 
.0004/11 (.09 ng) of the S200-purified protein was added (Figure 40). In a second identical 
experiment, significant enhancement was noted when .004-4iul was added, and very little 
improvement was noted with .0004/il (Figure 41). 

The PGR enhancing titer of PEF preparation 2 was also determined (Figure 42-43). 
Preparation 2 consisted of a concentrated (Microcon-30) heparin sepharose fraction, in which P. 
furiosus PEF made up approximately 10% of the total protein and was present at approximately 40 
ng///l, Enhanced PGR product yield was observed when .0002-.2/jl (.008-8 ng) of the column 
fraction was added to 100//I PGR reactions containing 2.5U of Pfu DNA polymerase, with greatest 
enhancements observed in the range of .002-.2/fl (.08-8 ng) (Figure 42). In a second identical 
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experiment, enhanced PCR product yield was observed when .004-.04//I (.16-1.6 ng) of PEF 
preparation 2 was added to 2.5U of DNA polymerase per 100//I PCR reaction (Figure 43). No PCR 
product was observed in the presence of 4//I (160 ng) or .0004//I (.016 ng) of the column fraction, 
while a smear was generated when .4|il (16 ng) of the column fraction was added to PCR. Smeary 
PCR products were also noted previously when ^/A of heparin sepharose fractions containing the 
highest concentrations of PEF are added to PCRs (e.g., lot 37 H.S. fractions 69-81 in Figure 44; 
PEF prep. 2 in Figure 31). 

In summary, sunstantially homogeneous P. furiosus PEF enhances the performance of Pfu 
DNA polymerase in test PCR amplifications when added at concentrations spanning a 10,000-fold 
range (0.09-900 ng/100//l). The highest yields are observed in the presence of *1 to 100 ng of 
PEF. The addition of excess PEF (>900 ng/100//l) or PEF < 9 pg/100//l PCR reaction, was found to 
give reduced perfomiance (lower PCR product yield). Partially-purified PEF samples (heparin 
sepharose column fractions) also appear to enhance PCR product yield over a fairiy broad range of 
PEF concentrations. With the column frsiction analyzed here, highest yields of PCR were obtained 
in the range of .08 ng to 8 ng. The addition of higher amounts of the column fraction resulted in 
smearing (16-40 ng) or lack of enhancement (160 ng). 

Inhibition of PCR enhancement at high concentrations of PEF appears to occur irrespective 
of the purity of the PEF sample. The addition of higher concentrations of homogeneous PEF (>900 
ng) resulted in lower yields of PCR product than could be obtained with <900 ng PEF. Heparin 
sepharose fractions of 10-20% purity also gave reduced PCR product yields when high amounts of 
PEF were added. Up to 8 ng of PEF in prep. 2 (H.S. #78 fraction) could be added before smearing 
or inhibition occun-ed. The discrepancy t)etween the amount of PEF which is inhibitory in 
homogeneous preparations (>900 ng), as compared to partially-purified column fractions (>16 ng), 
suggests that additional protein or DNA contaminants may be present in the heparin sepharose 
fractions. 

Examination of heparin sepharose fractions revealed that P. furiosus PEF elutes just after the 
major peak of Pfu DNA polymerase activity (e.g., fractions 50-61 in Figure 32). SDS-PAGE analysis 
showed that the highest levels of the >250kD PEF appeared in fractions 63-78 from the native Pfu 
DNA polymerase purification (Figure 32). PCR enhancing activity was observed in fractions 37-90 
(Figure 44). Discrete PCR product bands were generated with fractions 37-48 and 87-90, which 
contain very low levels of PEF. Interestingly, DNA smears were generated with fractions 69-81 
(Figure 44), which contain the highest levels of PEF and no detectable Pfu DNA polymerase. These 
results are consistent with the hypothesis that P. furiosus PEF acts as an inhibitory substance when 
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present at high concentrations. However, concentrations high enough to adversely effect 
polymerization and PGR reactions can easily be detennined through the methods and assays 
described herein. Thus, one skilled in the art can avoid concentrations of PEF that inhibit 
polymerase activity or PGR reactions without undue experimentation. 
5 As obsen/ed with PGR, inhibition during linear amplification protocols was noted with high 

concentrations of PEF-containing heparin sepharose fractions (Figure 39). The addition of 40 ng 
{^fA prep. #2) of PEF to QuickGhange™ reactions resulted in reduced yield of amplification product, 
as visualized by a reduction in the ethidium bromide-staining intensity of DNA bands on agarose 
gels. Reduced yield accompanied a 1.8 to 2.8-foId reduction in the number of transformants and a 

10 slight, but reproducible, decrease in mutation frequency. 

A possible and the most likely explanation for inhibition by homogenous PEF preparations is 
depletion of dGTP. In Example 11, section 1, we demonstrated that PEF can utilize dCTP as a 
substrate, although much less efficiently than dUTP. At high PEF concentrations it is possible that 
enough dGTP is hydrolyzed by PEF to drop the dGTP levels below what is required for optimal DNA 

j16 synthesis. It is also possible that moderate to high levels of PEF could alter dGTP levels enough to 
affect DNA polymerase misincorporation rates. Alternatively, contaminants in the substantially 
homogenous PEF preparations may also cause the inhibition and may only be present in sufficient 
concentrations when high concentrations of PEF are used. 

Each of the references refen^d to herein can be relied on by one skilled in the art in making 
j20 and using embodiments of the invention. In addition, each reference is specifically incorporated, in 
its entirety, into this disclosure. 

The sequence listing information that follows incorporates the sequences in prior U.S. Patent 
application serial no. 08/822,744, which is specifically incorporated herein by reference. The 
sequence infonnation from any one sequence or any combination of sequences can be translated 
25 into a computer readable medium by those of skill in the art. Furthermore, the sequences of the 
specific clones or plasmids described or identified herein can be easily detemriined and used in a 
computer readable medium by one skilled in the art. 
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What is claimed is: 

1 . A non-naturally occurring composition of matter comprising at least one component 
possessing nucleic acid polymerase enhancing activity selected from the group consisting of: an 
isolated or purified naturally-occuning polymerase enhancing protein obtained from a bacterial, 
eukaryotic, or archael source; a wholly or partially synthetic protein having the same amino acid 
sequence as said naturally-occuning protein or analogs thereof possessing polymerase enhancing 
activity; polymerase-enhancing mixtures of one or more of said naturally occuning or wholly or 
partially synthetic proteins; polymerase-enhancing protein complexes of one or more of said naturally 
occuning or wholly or partially synthetic proteins; or polymerase enhancing partially purified cell 
extracts containing one or more of said naturally occuning proteins. 

2. A composition of matter according to claim 1, wherein said component possessing 
polymerase enhancing activity is a cell extract. 

3. A composition of matter according to claim 2, wherein said cell extract is from an 
archael source. 

4. A composition of matter according to claim 3, wherein said cell extract is from 
Pyrococcus furiosus. 

5. A composition of matter according to claim 1, wherein said component possessing 
polymerase enhancing activity is a protein complex. 

6. A composition of matter according to claim 5, wherein said protein complex is from an 
archael source. 

7. A composition of matter according to claim 6, wherein said protein complex is from 
Pyrococcus furiosus. 

8. A composition of matter according to claim 7, wherein said protein complex is P300. 

9. A composition of matter according to claim 5, wherein said protein complex comprises 
a plurality of subunits wherein at least one subunit has a molecular weight of approximately 17-18kD 
in the fully denatured, monomeric fomri. 

10. A composition of matter according to claim 9, wherein a subunit has a sequence of 
amino acids at the amino terminal end comprising SEQ ID NO: 46. 

11. A composition of matter according to claim 9, wherein a subunit has a sequence of 
amino acids comprising one of SEQ ID NO: 47 or 48. 

12. A composition of matter according to claim 10 or 11, further comprising a subunit 
encoded by a DNA having the nucleotide sequence of SEQ ID NO: 18, degenerate variants thereof, 
or sequencecs hybridizable thereto. 
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13. A composition of matter according to claim 1, wherein said component possessing 
polymerase enhancing activity is a protein. 

14. A composition of matter according to claim 13, wherein said protein is from an archael 

source. 

15. A composition of matter according to claim 14, wherein said protein is from 

Pyrococcus furiosus. 

16. A composition of matter according to claim 15, wherein said protein comprises at least 
one protein having a molecular weight of approximately 17-18kD in the fully denatured, monomeric 
fomn. 

17. A composition of matter according to claim 16, wherein said protein is selected from 
the group consisting of: a protein having a sequence of amino acids at or within about 20 amino 
acids from the amino tenninal end comprising one of SEQ ID NO: 69 or 11; a protein encoded by a 
nucleic acid having the sequence of SEQ ID NO: 70, degenerate variants thereof, or sequences 
hybridizable thereto; or a protein having a sequence of amino acids comprising SEQ ID NO: 71. 

18. A composition of matter according to claim 1, wherein said component possessing 
polymerase enhancing activity is a wholly or partially synthetic protein having the same amino acid 
sequence as said naturally-occuning protein or analogs thereof. 

19. A composition of matter according to claim 18, wherein said protein has a molecular 
weight of approximately 17-18kD in the fully denatured, monomeric fonn. 

20. A composition of matter according to claim 19, wherein said protein has a sequence 
of amino acids at or within 20 amino acids of the amino temiinal end comprising one of SEQ ID NO: 
11 or 69. 

21. A composition of matter according to claim 19, wherein said protein has a sequence 
of amino acids comprising SEQ ID NO: 71. 

22. A composition of matter according to claim 20 or 21, further comprising a subunit 
encoded by a DNA having the nucleotide sequence of SEQ ID NO: 70. 

23. A composition of matter according to claim 1 , wherein said component possessing 
polymerase enhancing activity is a mixture of proteins. 

24. An isolated or purified DNA comprising a sequence encoding a protein according to 
the protein of one of claims 16 or 17. 

25. An isolated or purified DNA having a sequence selected from the group consisting of: 
the sequence set forth in SEQ ID NO: 18, degenerate sequences thereof, or DNA sequences 
hybridizable thereto; the sequence set forth in SEQ ID NO: 70, degenerate sequences thereof, or 
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DNA sequences capable of hybridizing thereto, 

26. An isolated or purified DNA sequence capable of hybridizing to DNA sequence 
according to claim 24 under stringent conditions. 

27. An isolated or purified DNA sequence capable of hybridizing to a DNA sequence 
encoding a protein according to claim 20. 

28. A composition of matter comprising a polymerase-enhancing protein encoded by DNA 
according to claim 26. 

29. A composition of matter comprising a polymerase-enhancing protein encoded by DNA 
according to claim 27. 

30. A non-naturally occuning mixture of a polymerase-enhancing composition according 
to claim 1, with one or more DNA polymerases. 

31 . A mixture according to claim 30, wherein at least one of said polymerases is a 
themnostable DNA polymerase. 

32. A mixture according to claim 30, wherein at least one of said polymerases is derived 
from an archael source. 

33. A mixture according to claim 31, wherein at least one of said polymerases is a DNA 
polymerase derived from the Pyrococcus species or the Thermococcus species. 

34. A mixture according to claim 31 , wherein at least one of said polymerases is 
Pyrococcus fuhosus, Pyrococcus sp. JDF3, Pyrococcus sp. GBD, Pyrococcus sp. KOD, 
Thermococcus litoralis, or Pyrococcus woesii DNA polymerase. 

35. A kit for replicating nucleic acids comprising a polymerase-enhancing composition of 
claim 1 and at least one nucleic acid polymerase. 

36. A kit according to claim 35, containing at least one recombinant nucleic acid 
polymerase, 

37. A kit according to claim 35 or 36, capable of use in a site-directed mutagenesis 
method. 

38. A kit according to claim 35 or 36, capable of use in a nucleic acid sequencing method. 

39. .A kit according to claim 35 or 36, capable of use in an amplification reaction. 

40. A method of enhancing a nucleic acid polymerase reaction comprising, in any 
appropriate order: 

(a) mixing a nucleic acid sequence template for a nucleic acid polymerase with at 
least one nucleic acid polymerase; and 

(b) adding to (a) a polymerase enhancing composition according to claim 1. 
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41. A method according to claim 40, wherein said reaction is a replication reaction. 

42. A method according to claim 40, wherein said reaction comprises an amplification 
reaction. 

43. A method according to claim 40, wherein said reaction comprises a PGR process or 
RT-PCR process. 

44. A method according to claim 41 , 42, or 43, further comprising a site-directed 
mutagenesis process, a cycle sequencing process, or a cloning process. 

45. An antibody that binds to a composition of matter of claim 1 . 

46. An antibody that binds to a protein having an amino acid sequence comprising one of 
SEQ ID NO: 19 or 71. 

47. A method for purifying a polymerase-enhancing protein comprising; 

(a) solubilizing the protein from archae cells while substantially maintaining 
protein:protein interactions; 

(b) perfomning heparin sepharose chromatography on said sample; 

(c) perfomning size exclusion chromatography on the product of step (b); and 

(d) identifying a polymerase enhancing activity. 

48: A method for identifying the presence or absence of a composition of matter with 
polymerase enhancing activity comprising adding a protein extract from cells to a nucleic acid 
polymerization reaction and measuring the number of products produced in said polymerization 
reaction compared to the number of products in a control reaction. 

49. The method of claim 48, wherein the polymerase activity employed in said nucleic 
acid polymerization reaction comprises at least one of native or cloned bacterial DNA polymerase, 
native or cloned archael DNA polymerase, native of cloned polymerase from the Pyrococcus species 
or the Thermococcus species, Pyrococcus furiosus DNA polymerase, native or cloned reverse 
transcriptase, or native or cloned RNA polymerase. 

50. The method of claim 48, wherein the polymerization reaction comprises one of a PGR 
process or RT-PGR process. 

51. The method of claim 49, wherein the polymerization reaction comprises one of a PGR 
process or RT-PGR process. 

52. A method for identifying DNA encoding polymerase-enhancing activity in a sample 
comprising contacting a nucleic acid probe having all or a portion of the sequence of nucleotides 
from SEQ ID NO: 70, or a sequence of nucleotides encoding all or a portion of the amino acid 
sequence of SEQ ID NO: 71, to nucleic acids of the sample and isolating a nucleic acid capable of 
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hybridizing to said probe. 

53. A method as claimed in claim 52, wherein a hybridization condition is employed 
comprising a low stringency wash in a solution comprising approximately .45 M NaCI. approximately 
.045 M trisodium citrate, and approximately .1% SDS, and wherein the wash temperature is 
approximately 37^ to approximately 42**C. 

54. A method as claimed in claim 53, wherein a hybridization buffer is employed 
comprising approximately .75M NaCI, approximately .075 M trisodium citrate, and approximately 
50% fonnamide, and wherein a hybridization wash is employed comprising approximately .1 M 
phosphate, approximately .1 x SET, approximately .1% sodium pyrophosphate, and approximately 
.1% SDS at approximately 45'*C. 

55. A method for identifying DNA encoding polymerase-enhancing activity in a sample 
comprising perfomiing an amplification reaction with at least one primer capable of hybridizing to a 
DNA encoding a polymerase-enhancing activity or protein component thereof. 

56. A method as claimed in claim 55, wherein at least one primer comprises an at least 
15 nucleotide portion of the DNA of SEQ ID NO: 70, or a DNA comprising a sequence encoding an 
at least 5 amino acid portion of SEQ ID NO: 71, or a DNA comprising one of claims 25-27. 

57. A method for identifying polymerase-enhancing activity in a sample comprising 
contacting an antibody of claim 45 or 46 with said sample and detecting protein binding to said 
antibody. 

58. A DNA construct comprising a sequence encoding PEF protein P45 operably linked to 
an expression vector. 

59. A P45 protein produced from a cell containing a DNA constnjct as claimed in claim 
58, wherein the protein is in monomeric, dimeric, or multimeric form. 

60. A P45 protein as claimed in claim 59, wherein the cell is a bacterial cell, 

61. A PEF complex comprising a P45 protein as claimed in claim 59. 

62. An antibody that binds to a P45 protein as claimed in claim 59. 

63. An antibody that binds to a PEF complex as claimed in claim 61. 

64. A P45 protein produced from a cell containing a DNA constmct as claimed in claim 
58, wherein the P45 protein is produced as a fusion protein. 

65. A P45 protein as claimed in claim 64, wherein the fusion protein comprises a 
calmodulin binding peptide. 

66. A P45 protein as claimed in claim 65, wherein the expression vector is 
pCAL-n-EK. 
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67. A kit for replicating nucleic acids comprising at least one polymerase, a P45 protein as 
claimed in claim 59, and reagents for perfomiing a polymerization reaction. 

68. A kit as claimed in claim 67, wherein the P45 protein is present in a PEF complex. 

69. A method of enhancing a nucleic acid polymerase reaction comprising adding a P45 
protein as claimed in claim 59 to a polymerization reaction. 

70. A method of enhancing a nucleic acid polymerase reaction as claimed in claim 69, 
wherein the P45 protein is present in a PEF complex. 

71. A method of enhancing a nucleic acid polymerase reaction comprising performing the 
reaction in the presence one or more of the following: a PEF; a dUTPase activity; a protein that 
tums-over dUTP; a protein having one or more of SEQ ID NO.: 72-81. 

72. A method for controlling the activity of a polymerase in a polymerization reaction, 
comprising changing the amount of dUTP present or generated during the reaction by adding a PEF 
activity. 

73. A method as claimed in claim 71, wherein the dUTPase activity comprises one or 
more of a P45 protein as claimed in claim 59, a human dUTPase, a bacterial dUTPase, an archael 
dUTPase, a yeast dUTPase, a mammalian dUTPase, or an animal dUTPase. 

74. A method as claimed in claim 72, wherein the PEF activity comprises a P45 protein as 
claimed in claim 59. 

75. A method for detecting the presence or absence of PEF activity in a sample 
comprising adding the sample to a nucleic acid polymerase reaction containing dUTP or dCTP and 
monitoring the inorganic pyrophosphate levels. 

76. A method for detecting the presence or absence of PEF activity in a sample 
comprising adding the sample to a nucleic acid polymerase reaction containing dUTP and monitoring 
any change in polymerization levels. 

77. A non-naturally occuning composition of matter comprising a P45 protein. 

78. A composition of matter as claimed in claim 77, wherein the P45 protein is in 
monomeric, dimeric, or multimeric fomi. 

79. A composition of matter as claimed in claim 77, wherein the P45 protein is present in 
a protein complex. 

80. A composition of matter as claimed in claim 77, wherein the P45 protein is an analog 
P45 protein. 

81. A method of producing a P45 protein comprising transfening a DNA constmct as 
claimed in claim 58 into a host cell and expressing the P45 protein. 



-74- 



82. A method as claimed in claim 81, wherein the P45 protein is expressed as a fusion 

protein. 

83. A method of producing a PIEF analog protein comprising introducing at least one 
mutation into the sequence encoding P45 protein of the DNA construct of claim 58 or into a 
sequence encoding a dUTPase protein, transfening the sequence to a host cell, and expressing the 
PEF analog protein. 

84. A DNA encoding a PEF activity comprising one or more of SEQ ID NO,: 32-35, 82, 
83, or 70, a sequence capable of hybridizing to one or more those sequences under stringent 
conditions, or degenerate variants of either. 

85. A protein having PEF activity comprising one or more of SEQ ID NO.: 72-81. 

86. A method for cloning a PEF activity comprising employing one or more nucleic acids 
comprising one or more of SEQ ID NO.: 32-35, 82, 83, or all or a portion of 70, and identifying a 
clone containing a sequence that hybridizes to the one or more nucleic acids. 

87. A PGR enhancing, protein extract comprising purified proteins from Thermus 
thermophilis that possesses dUTPase activity 

88. A composition comprising a protein extract as claimed in claim 87. 

89. A composition comprising a protein extract as claimed in claim 87, further comprising 
a themiostable DNA polymerase. 

90. A protein extract of claim 87, which comprises a protein that can be bound by an 
antibody specific for recombinant Pfu P45 protein, 

91. A composition comprising a protein extract of claim 90 and a thermostable DNA 
polymerase. 

91. A protein extract of claim 87, which comprises a protein that possesses a molecular 
weight of approximately 24kD in an SDS-PAGE gel. 

92. A protein extract of claim 90, wherein the protein possesses a molecular weight of 
approximately 24kD in an SDS-PAGE gei. 

93. A computer readable medium having stored in it the full or partial amino acid or DNA 
sequence information of a PEF protein. 

94. A computer-based method of screening for a PEF, comprising providing a computer 
readable medium as claimed in claim 93, and identifying other sequences in a database that 
possess sequence homology, similarity, or identity to all or a portion of the sequence stored in the 
computer readable medium. 
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ABSTRACT 

The invention provides novel extracts, proteins, and complexes that improve the 
polymerization activity of nucleic acid polymerases. Included within the aspects of the invention are 
methods for identifying compositions with a polymerase enhancing activity, methods for purifying and 
using these compositions, and specific extracts, proteins, and complexes that function to enhance 
polymerase activity As an example, specifically described is nucleotide and amino acid sequence 
infonnation for a Pyrococcus furiousus PE:F (P45), which was used to produce a recombinant PER 
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( I ) GENERAL INFORMATION: 

(i) APPLICANT: HOGREFE, Holly 

HANSEN, Connie J 

(ii) TITLE OF INVENTION: Polymerase Enhancing Factor (PEF) 

Extracts, PEF Protein Complexes, Isolated PEF Proteins, 
and Methods for Purifying and Identifying Them 

(ill) NUMBER OF SEQUENCES: 89 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: David J. Kulik, Evenson, McKeown, Edwards & 
Lenahan P.L.L.C. 

(B) STREET: 1200 G Street, NW Suite 700 

(C) CITY: Washington 

(D) STATE: DC 
(F) ZIP: 20005 

(v) COMPUTER READABLE FOFLM: 

(A) MEDIUM TYPE: Floppy diisk 

(B) COMPUTER: IBM PC com]3atible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION D^^iTA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 24-OCT-1997 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: KULIK, David J 

(B) REGISTRATION NUMBEPL: 36,576 

(C) REFERENCE/DOCKET NUMBER: 1486/43 163cp 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 202 628-8800 

(B) TELEFAX: 202 628-8844 



(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: N-terminal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

Xaa Xaa Leu His His Val Lys Leu lie Tyr Ala Thr Xaa Xaa Xaa 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(V) FRAGMENT TYPE: N-terminal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Xaa Xaa Xaa Pro Asp Trp Xaa Xaa Arg Xaa Giu Xaa Leu Xaa Xaa 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(iii) HYPOTHETICAL: NO 



(iv) ANTI-SENSE: NO 
(V) FRAGMENT TYPE: N- terminal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Xaa Leu Leu His His Val Lys Leu He Tyr Ala Thr Lys Xaa Arg Xaa 
15 10 15 

Leu Val Gly Lys Xaa He Val Leu Ala He Pro Gly Xaa Xaa Ala Xaa 
20 25 30 

Xaa Xaa Xaa 
35 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(V) FRAGMENT TYPE: N-terminal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Xaa Xaa Xaa Pro Asp Trp Xaa Xaa Arg Xaa Glu Xaa Leu Xaa Glu Xaa 
15 10 15 

Xaa Xaa 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(V) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Xaa Tyr Asp Ala Val lie Met Ala Ala Ala Val Val Asp Phe Arg Pro 
IS 10 15 

Lys 

) INFORMATION FOR SEQ ID NO: 6: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(V) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Ala Asp Leu Val Val Gly Asn Thr Leu Glu Ala Phe Gly Ser Glu Glu 
15 10 15 

Asn Gin Val Val Leu He Gly Arg 
20 

) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: N-terminal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Gly Ala Met Leu His His Val Lys Leu He Tyr Ala Xaa Lys Leu Arg 
15 10 15 



Lys 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: N-terminal 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:8: 

Gly Ala Met Leu His His Val Lys Leu lie Tyr Ala Thr Lys Xaa Xaa 
15 10 15 

Arg Lys 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: N-terminal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Met Leu Hii His Val Lys Leu lie Tyr Ala Thr Lys Leu 
15 10 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: N-terminal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Gly Xaa Xaa Xaa Pro Asp Trp Xaa Xaa Lys Phe Arg Lys Glu Giu Ser 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: N-terminal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Gly Ala He Leu Leu Pro Asp Trp Lys He Arg Lys Glu He Leu He 
15 10 15 

Glu 

(2) INF0RI4ATI0N FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: N-terminal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Xaa Met His His Val He Lys Leu Xaa Tyr Ala Thr Xaa Ser Arg Lys 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: N-terminal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Met Leu Tyr Leu Val Arg Pro Asp Trp Lys Arg Arg Lys Glu He Leu 
15 10 15 

He Glu 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

CAYCAYGAHA ARYTHATTTA CGC 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 base pairs 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 
GCCATDATNA CDGCRTCGTA TTT 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 
CAYCAYGAHA ARYTHATATA CGC 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 
ARDACDACYT GRTTTTCTTC 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1209 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

ATGCTTCACC ACGTCAAGCT AATCTACGCC ACAAiUVAGTC GAAAGCTAGT TGGAAAAAAG 60 

ATAGTCNNNN NNNNNCCAGG GAGTATTGCG GCTTTGGATG TGAAAGCTTG TGAGGGACTA 120 

ATTAGGCATG GGGCCGAAGT TCATGCAGTG ATGAGTGAGG CAGCCACCAA GATAATTCAT 180 

CCTTATGCAT GGAATTTGCC CACGGGAAAT CCAGTCATAA CTGAGATCAC TGGATTTATC 240 

GAGCATGTTG AGTTAGCAGG GGAACATGAG AATA;iAGCAG ATTTAATTTT GGTTTGTCCT 300 

GCCACTGCCA ACACAATTAG TAAGATTGCA TGTGGAATAG ATGATACTCC AGTAACTACA 360 

GTCGTGACCA CAGCATTTCC CCACATTCCA ATTAI^GATAG CCCCAGCAAT GCATGAGACA 420 

ATGTACAGGC ATCCCATAGT AAGGGAGAAC ATTG^iAAGGT TAAAGAAGCT TGGCGTTGAG 480 

TTTATAGGAC CAAGAATTGA GGAGGGAAAG GCAA^iAGTTG CAAGCATTGA TGAAATAGTT 540 

TACAGAGTTA TTAAAAAGCT CCACAAAAAA ACATI^GGAAG GGAAGAGAGT CCTAGTAACG 600 

GCGGGAGCAA CAAGAGAGTA CATAGATCCA ATAACJATTCA TAACAAATGC CAGCAGTGGA 660 

AAAATGGGAG TAGCGTTGGC TGAAGAAGCA GATTI^TAGAG GAGCTGTTAC CCTCATAAGA 720 

ACAAAGGGAA GTGTAAAGGC TTTTAGAATC AGAA/iAATCA AATTGAAGGT TGAGACAGTG 780 

GAAGAAATGC TTTCAGCGAT TGAAAATGAG TTGAG^GAGTA AAAAGTATGA CGTAGTTATT 840 

ATGGCAGCTG CTGTAAGCGA TTTTAGGCCA AAAATTAAAG CAGAGGGAAA AATTAAAAGC 900 

GGAAGATCAA TAACGATAGA GCTCGTTCCN NNNAJlTCCCA AAATCATTGA TAGAATAAAG 960 

GAAATTCAAC CAAATGTCTT TCTTGTTGGA TTTAi^iAGCAG AAACTTCAAA AGAAAAGCTT 1020 

ATAGAAGAAG GTAAAAGGCA GATTGAGAGG GCCAJlGGCTG ACTTAGTCGT TGGTAACACA 1080 

TTGGAAGCCT TTGGAAGCGA GGAAAACCAA GTAGl'ATTAA TTGGCAGAGA TTTCACAAAA 1140 



GAACTTCCAA AAATGAAAAA GAGAGAGTTA GCAGAGAGAA TTTGGGATGA GATAGAGAAA 1200 



TTNCTGTCC 

(2) INFORMATION FOR SEQ ID NO: 19; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 403 amino acids 

(B) TYPE: amino acid 

{ D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Met Leu His His Val Lys Leu He Tyr Ala Thr Lys Ser Arg Lys Leu 
1 5 10 15 

Val Gly Lys Lys He Val Xaa Xaa Xaa Pro Gly Ser He Ala Ala Leu 

20 25 30 

Asp Val Lys Ala Cys Glu Gly Leu He Arg His Gly Ala Glu Val His 
35 40 45 

Ala Val Met Ser Glu Ala Ala Thr Lys He He His Pro Tyr Ala Trp 
50 55 60 

Asn Leu Pro Thr Gly Asn Pro Val He Thr Glu He Thr Gly Phe He 
65 70 75 80 

Glu His Val Glu Leu Ala Gly Glu His Glu Asn Lys Ala Asp Leu He 
85 90 95 

Leu Val Cys Pro Ala Thr Ala Asn Thr He Ser Lys He Ala Cys Gly 
100 105 110 

He Asp Asp Thr Pro Val Thr Thr Val Val Thr Thr Ala Phe Pro His 
115 120 125 

He Pro He Met He Ala Pro Ala Met His Glu Thr Met Tyr Arg His 
130 135 140 

Pro He Val Arg Glu Asn He Glu Arg Leu Lys Lys Leu Gly Val Glu 
145 ' 150 155 160 

Phe He Gly Pro Arg He Glu Glu Gly Arg Ala Lys Val Ala Ser He 
165 170 175 

Asp Glu He Val Tyr Arg Val He Lys Lys Leu His Lys Lys Thr Leu 
180 185 190 

Glu Gly Lys Arg Val Leu Val Thr Ala Gly Ala Thr Arg Glu Tyr He 



195 



200 



205 



Asp Pro lie Arg Phe He Thr Asn Ala Ser Ser Gly Lys Met Gly Val 
210 215 220 

Ala Leu Ala Glu Glu Ala Asp Phe Arg Gly Ala Val Thr Leu He Arg 
225 230 235 240 

Thr Lys Gly Ser Val Lys Ala Phe Arg He Arg Lys He Lys Leu Lys 
245 250 255 

Val Glu Thr Val Glu Glu Met Leu Ser Ala He Glu Asn Glu Leu Arg 
260 265 270 

Ser Lys Lys Tyr Asp Val Val He Met Ala Ala Ala Val Ser Asp Phe 
275 280 285 

Arg Pro Lys He Lys Ala Glu Gly Lys He Lys Ser Gly Arg Ser He 
290 295 300 

Thr He Glu Leu Val Pro Xaa Asn Pro Lys He He Asp Arg He Lys 
305 310 315 320 

Glu He Gin Pro Asn Val Phe Leu Val Gly Phe Lys Ala Glu Thr Ser 
325 330 335 

Lys Glu Lys Leu He Glu Glu Gly Lys Arg Gin He Glu Arg Ala Lys 
340 345 350 

Ala Asp Leu Val Val Gly Asn Thr Leu Glu Ala Phe Gly Ser Glu Glu 
355 360 365 

Asn Gin Val Val Leu He Gly Arg Asp Phe Thr Lys Glu Leu Pro Lys 
370 375 380 

Met Lys Lys Arg Glu Leu Ala Glu Axg He Trp Asp Glu He Glu Lys 
385 390 395 400 



Xaa Leu Ser 



) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unJcnown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20 
CATAGCGAAT TCGCAAAACC TTTCGCGGTA TGG 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21 
ACTACGGAAT TCCACGGAAA ATGCCGCTCA TCC 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

{ C ) STRANDEDNESS : single 
( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 
GGCGTTTCCG TTCTTCTTCG 
(2) INFOR24ATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



(iv) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 
CCATCTCACG CGCCAGTTTC 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 
GAGGAGAGCA GGAAAGGTGG AAC 
(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25 
GCTGGGAGAA GACTTCACTG G 
(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
GAGCTTGCTC AACTTTATC 
(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 
GATAGAGATA GTTTCTGGAG ACG 
(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

CGGGATATCG ACATTTCTGC ACC 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 base pairs 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29 
GAGTTAAATG CCTACACTGT ATCT 
(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30 
CAGGACTCAG AAGCTGCTAT CGAA 
(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31 
CTGCACGTGC CCTGTAGGAT TTGT 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32 
CCAGAYTGGA ARWKNAGGAA AGA 
(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33 
CCAGAYTGGA ARWKNAGAAA AGA 



(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
CCAGAYTGGA ARWKNAGGAA GGA 
(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
CCAGAYTGGA ARWKNAGAAA GGA 
(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

CAGAGTGGGC AGAGAGGCTN TTGTTAAGGG GAA/iTTAATC GACGTGGAAA 
AGGAAGGAAA 60 

AGTCGNTATT CCTCCAAGGG AATA 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 



(ii) MOLECULE TYPE: peptide 



(iii) HYPOTHETICAL: YES 
(iv) ANTI-SENSE: NO 
(V) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

Glu Trp Ala Glu Arg Leu Leu Leu Arg Gly Asn Xaa Ser Lys Trp Lys 
15 10 15 

Arg Lys Glu Lys Ser Xaa Phe Leu Gin Gly Asn 
20 25 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

Arg Val Gly Arg Glu Ala Xaa Val Lys Gly Lys Leu He Glu Val Glu 
15 10 i5 

Lys Glu Gly Lys Val Xaa He Pro Pro Arg Glu 
20 25 

(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 
(D) .TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: YES 

(iv) ANT I -SENSE: NO 

(V) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:39: 

Gin Ser Gly Gin Arg Gly Xaa Cys Xaa Gly Glu He Asn Arg Ser Gly 
15 10 15 

Lys Gly Arg Lys Ser Arg Tyr Ser Ser Lys Gly Leu 
20 25 



(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 129 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 
CTGCCCACTC TGAGGTCATA ACCTGCTGGT TGGACiCCATT CTTCAGAAAA TGGCTCTATA 60 
AGTATTTCTT TTCTGATTTT CCAGTCTGGA AGTAC^CATTT TACCACCGAA ACCTTTATTT 120 
TTAATTTAA 129 
(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: N-terminal 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:41: 

Xaa He Lys Asn Lys Gly Phe Gly Gly Lys Met Leu Leu Pro Asp Trp 

1 5 10 15 . 



Lys He Arg Lys Glu lie Leu lie Glu Pro Phe Ser Giu Glu Trp Leu 
20 25 30 



Gin Pro Ala Gly Tyr Asp Leu Arg Val Gly 
35 40 

(2) INFORMATION FOR SEQ ID NO: 42: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 740 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

TCCTCCAAGG GAATACGCCT TAATCCTAAC CCTCGAGAGG ATAAAGTTGC CCGACGATGT 60 

TATGGGGGAT ATGAAGATAA GGAGCAGTTT AGCAAGAGAA GGGGTTATTG GTTCTTTTGC 120 

TTGGGTTGAC CCAGGATGGG ATGGAAACTT AACACTAATG CTCTACAATG CCTCAAATGA 180 

ACCTGTCGAA TTAAGATATG GAGAGAGATT TGTGCAGATC GCATTTATAA GGCTAGAGGG 240 

TCCGGCAAGA AACCCTTACA GAGGAAACTA TCAGGGGAGC ACAAGGTTAG CGTTTTCAAA 300 

GAGAAAGAAA CTCTAGCGTC TTTTCAATAG CATCCTCAAT ATCTCGTGTG AAGTAATCAA 360 

TGTAAATACT TGCTGGGTGG GTTTTTAGGG ATTCAAACTC GTAAGATGGG CCTGTATAGC 420 

AGAAAACTAT TTTTGCCTCT TCTTCATTTA TCTTTCTGTG AATAAAAAAT CCAACATCCA 480 

CACTAGTTCC AAAAGATATT GTTTGCGTGA TTACCAACAA GATCTTGGCA TTATTTTTGA 540 

TCTTATACTC TATTCTCCTT TCTCCCTCCA ATTTGCCCAA AATAAACCTG GGTAGTATAC 600 

ATTCACTCCT CTCTTTTAAA TTCCTATAAA TTCGTACATA GTTTAGAAAA ATGTCAAATT 660 

CTTTNTTCCC TGTTAAATTA ACCNCNAAAT CTTTATNANN AANCTTTTTA TAATTCCCAA 720 

AACCCCTAAT TTTCCCCTTN 740 
(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 246 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 



(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: YES 
(iv) ANTI-SENSE: NO 
(V) FRAGMENT TYPE: N-terminal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

Leu Gin Giy Asn Thr Pro Xaa Ser Xaa Pro Ser Arg Gly Xaa Ser Cys 
15 10 15 

Pro Thr Met Leu Trp Gly lie Xaa Arg Xaa Gly Ala Val Xaa Gin Glu 
20 25 30 

Lys Giy Leu Leu Val Leu Leu Leu Gly Leu Thr Gin Asp Gly Met Glu 
35 40 45 

Thr Xaa His Xaa Cys Ser Thr Met Pro Gin Met Asn Leu Ser Asn Xaa 
50 55 60 

Asp Met Glu Arg Asp Leu Cys Arg Ser His Leu Xaa Gly Xaa Arg Val 
65 70 75 80 

Arg Gin Glu Thr Leu Thr Glu Glu Thr He Arg Gly Ala Gin Gly Xaa 
85 90 95 

Arg Phe Gin Arg Glu Arg Asn Ser Ser Val Phe Ser He Ala Ser Ser 
100 105 110 

He Ser Arg Val Lys Xaa Ser Met Xaa He Leu Ala Gly Trp Val Phe 
115 120 125 

Arg Asp Ser Asn Ser Xaa Asp Gly Pro Val Xaa Gin Lys Thr He Phe 
130 135 140 

Ala Ser Ser Ser Phe He Phe Leu Xaa He Lys Asn Pro Thr Ser Thr 
145 150 155 160 

Leu Val Pro Lys Asp He Val Cys Val He Thr Asn Lys He Leu Ala 
165 170 175 

Leu Phe Leu" He Leu Tyr Ser He Leu Leu Ser Pro Ser Asn Leu Pro 
180 185 190 

Lys He Asn Leu Gly Ser He His Ser Leu Leu Ser Phe Lys Phe Leu 
195 200 205 

Xaa He Arg Thr Xaa Phe Arg Lys Met Ser Asn Ser Xaa Phe Pro Val 
210 215 220 



Lys Leu Thr Xaa Lys Ser Leu Xaa Xaa Xaa Phe Leu Xaa Phe Pro Lys 
225 230 235 240 



Pro Leu He Phe Pro Xaa 
245 

INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 246 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

Pro Pro Arg Giu Tyr Ala Leu lie Leu Thr Leu Glu Arg He Lys Leu 
15 10 15 

Pro Asn Asn Val Met Gly Asp Met Lys He Arg Ser Ser Leu Ala Arg 
20 25 30 

Glu Gly Val He Gly Ser Phe Ala Trp Val Asp Pro Gly Trp Asp Gly 
35 40 45 

Asn Leu Thr Leu Met Leu Tyr Asn Ala Ser Asn Glu Pro Val Glu Leu 
50 55 60 

Arg Tyr Gly Glu Arg Phe Val Gin He Ala Phe He Arg Leu Glu Gly 
65 70 75 80 

Pro Ala Arg Asn Pro Tyr Arg Gly Asn Tyr Gin Gly Ser Thr Arg Leu 
85 90 95 

Ala Phe Ser Lys Arg Lys Lys Leu Xaa Arg Leu Phe Asn Ser He Leu 
100 105 110 

Asn He Ser Cys Glu Val He Asn Val Asn Thr Cys Trp Val Gly Phe 
115 120 125 

Xaa Gly Phe Lys Leu Val Arg Trp Ala Cys He Ala Glu Asn Tyr Phe 
130 _ 135 140 

Cys Leu Phe Phe He Tyr Leu Ser Val Asn Lys Lys Ser Asn He His 
145 150 155 160 

Thr Ser Ser Lys Arg Tyr Cys Leu Arg Asp Tyr Gin Gin Asp Leu Gly 
165 170 175 

He He Phe Asp Leu He Leu Tyr Ser Pro Phe Ser Leu Gin Phe Ala 
ISO 185 190 



Gin Asn Lys Pro Gly Xaa Tyr Thr Phe Thr Pro Leu Phe Xaa lie Pro 
X95 200 205 

lie Asn Ser Tyr He Val Xaa Lys Asm Val Lys Phe Phe Xaa Pro Cys 
210 215 220 

Xaa lie Asn Xaa Xaa He Phe Xaa Xaa Xaa Leu Phe He He Pro Lys 

225 230 235 240 



Thr Pro Asn Phe Pro Leu 
245 



INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 246 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: YES 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO:45: 

ser ser Lys Gly He Arg Leu Asn Pro Asn Pro Arg Glu Asp Lys Val 
1 5 10 15 



Ala Arg Arg Cys Tyr Gly Gly Tyr Glu Asp Lys Glu Gin Phe Ser Lys 
20 25 30 

Arg Arg Gly Tyr Trp Phe Phe Cys Leu Gly Xaa Pro Arg Met Gly Trp 
35 40 45 

Lys Leu Asn Thr Asn Ala Leu Gin Cys Leu Lys Xaa Thr Cys Arg He 
50 55 50 

Lys He Trp Arg Glu He Cys Ala Asp Arg He Tyr Lys Ala Arg Gly 

65 70 75 

ser Gly Lys Lys Pro Leu Gin Arg Lys Leu Ser Gly Glu His Lys Val 

ser val Phe Lys Glu Lys Glu Thr Leu Ala Ser Phe Gin Xaa His Pro 
100 105 

Gin Tyr Leu Val Xaa Ser Asn Gin Cys Lys Tyr Leu Leu Gly Gly Phe 
115 120 125 

Leu Gly He Gin Thr Arg Lys Met Gly Leu Tyr Ser Arg Lys Leu Phe 
130 135 

Leu His Leu Ser Phe Cys Glu Xaa Lys He Gin His Pro 



Leu Pro Leu Leu His Leu aer ru« v-^=. 
145 



150 155 



His Xaa Phe Gin 



His Tyr Phe Xaa 
180 

Pro Lys Xaa Thr 
195 

Tyr Lys Phe Val 
210 

Leu Asn Xaa Pro 
225 



Lys lie Leu Phe 
165 

Ser Tyr Thr Leu 



Trp Val Val Tyr 
200 

His Ser Leu Giu 
215 

Xaa Asn Leu Tyr 

230 



Ala Xaa Leu Pro 
170 

Phe Ser Phe Leu 

185 

lie His Ser Ser 



Lys Cys Gin lie 
220 

Xaa Xaa Xaa Phe 
235 



Thr Arg Ser Trp 
175 

Pro Pro lie Cys 
190 

Leu Leu Asn Ser 

205 

Leu Xaa Ser Leu 



Tyr Asn Ser Gin 
240 



Asn Pro Xaa Phe Ser Pro 
245 



(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

( D ) TOPOLOGY : u nknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: N-terminal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

Met Leu His His val Lys Leu He Tyr Ala Thr Lys Ser Arg Lys Leu 
15 10 15 

Val Gly Lys Lys He Val Xaa Xaa Xaa Pro Gly Ser He Ala Ala 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUEliCE CHARACTERISTICS: 

(A) " LENGTH: 17 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(iii) HYPOTHETICAL: NO 



(iv) ANTI-SENSE: NO 
(V) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

Lys Tyr Asp Val Val He Met Ala Ala Ala Val Ser Asp Phe Arg Phe 
15 10 15 

Lys 



INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
(V) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

Ala Asp Leu Val Val Gly Asn Thr Leu Giu Ala Phe Gly Ser Glu Glu 

1.5 10 15 

Asn Gin Val Val Leu He Gly Arg 

20 

INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 



CTATTGAGTA CGAACGCCAT C 

(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 
GTCACGCTTG CTCCACTCCG 
(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 437 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Methanococcus Jannaschii 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

Met He Ser Glu He Met His Pro Thr Lys Leu Leu Lys Gly Thr Lys 
15 10 15 

Ser Lys Leu Leu Glu Asn Lys Lys He Leu Val Ala Val Thr Ser Ser 

- " 20 25 30 

He Ala Ala He Glu Thr Pro Lys Leu Met Arg Glu Leu He Arg His 
35 40 45 

Gly Ala Glu Val Tyr Cys He He Thr Glu Glu Thr Lys Lys He He 
50 55 60 

Gly Lys Glu Ala Leu Lys Phe Gly Cys Gly Asn Glu Val Tyr Glu Glu 



65 



70 



75 



80 



lie Thr Gly Xaa Xaa Xaa Xaa Xaa Asp lie Glu His lie Leu Leu Tyr 
85 90 95 

Xaa Xaa Xaa Xaa Asn Glu Cys Asp Cys Leu Leu lie Tyr Pro Ala Thr 
100 105 110 

Ala Asn lie lie Ser Lys lie Asn Leu Gly lie Ala Asp Asn lie Val 
115 120 125 

Asn Thr Thr Ala Leu Met Phe Phe Gly Asn Lys Pro lie Phe lie Val 
130 135 140 

Pro Ala Met His Glu Asn Met Phe Asn Xaa Xaa Ala lie Lys Arg His 
145 150 155 160 

lie Asp Lys Leu Lys Glu Lys Asp Lys He Tyr He He Ser Pro Lys 
165 170 175 

Phe Glu Glu Xaa Xaa Xaa Xaa Xaa Xaa Gly Lys Ala Lys Val Ala Asn 
180 185 190 

He Glu Asp Val Val Lys Ala Val He Glu Lys He Gly Asn Asn Leu 
195 200 205 

Lys Lys Glu Gly Asn Arg Val Leu He Leu Asn Gly Gly Thr Val Glu 
210 215 220 

Phe He Asp Lys Val Arg Val He Ser Asn Leu Ser Ser Gly Lys Met 
225 230 235 240 

Gly Val Ala Leu Ala Glu Ala Phe Cys Lys Glu Gly Phe Tyr Val Glu 
245 250 255 

Val He Thr Ala Met Gly Leu Glu Pro Pro Tyr Tyr He Lys Asn His 
260 265 270 

Lys Val Leu Thr Ala Lys Glu Met Leu Asn Lys Ala He Glu Xaa Xaa 
275 280 285 



Leu Xaa Ala Lys Asp Phe Asp He He He Ser Ser Ala Ala He Ser 
290 295 300 

Asp Phe Thr Val Glu Ser Xaa Phe Glu Gly Lys Leu Ser Ser Glu Glu 
305 310 315 320 

Glu Xaa xka Xaa Xaa Leu He Leu Lys Leu Lys Arg Xaa Asn Pro Lys. 

325 330 335 

Val Leu Glu Glu Leu Arg Arg He Tyr Lys Asp Xaa Lys Val He He 
340 345 350 



Gly Phe Lys Ala Glu Tyr Asn Leu Asp Glu Lys Glu Leu He Asn Arg 
355 360 365 



Ala Lys Glu Arg Leu Asn Lys Tyr Asn Leu Asn Met lie lie Ala Asn 
370 375 380 



Asp Leu Ser Lys Xaa Xaa His Tyr Phe Gly Asp Asp Tyr lie Glu Val 
385 390 395 400 

Tyr lie lie Thr Lys Tyr Glu Val Glu Lys lie Ser Gly Ser Lys Lys 
405 410 415 

Xaa Glu lie Ser Glu Arg lie Val Glu Lys Val Lys Lys Leu Val Lys 
420 425 430 

Ser Xaa Xaa Xaa Xaa 
435 

) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCK CHARACTERISTICS: 

(A) LENGTH: 444 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

Met Lys Ala Arg Gin Gin Lys Tyr Cys Asp Lys lie Ala Asn Phe Trp 
15 10 15 

Cys His Pro Thr Gly Lys lie lie Met Ser Leu Ala Gly Lys Lys lie 
20 25 30 

Val Leu Gly Val Ser Gly Gly lie Ala Ala Tyr Lys Thr Pro Glu Leu 
35 40 45 

Val Arg Arg Leu Arg Asp Arg Gly Ala Asp Val Arg Val Ala Met Thr 
50 55 60 

Glu Ala Ala Lys Ala Phe lie Thr Pro Leu Ser Leu Gin Ala Val Ser 
65 70 75 80 

Gly Tyr Pro Val Ser Asp Ser Leu Leu Asp Pro Ala Ala Glu Ala Ala 
85 90 95 

Met Gly His He Glu Leu Gly Xaa Xaa Xaa Xaa Lys Trp Ala Asp Leu 
100 105 110 



Val He Leu Ala Pro Ala Thr Ala Asp Leu He Ala Arg Val Ala Ala 



115 



120 



125 



Gly Met Ala Asn Asp Leu Val Ser Thr He Cys Leu Ala Thr Pro Xaa 
130 135 140 

Xaa Ala Pro Val Ala Val Leu Pro Ala Met Asn Gin Gin Met Tyr Arg 
145 150 155 160 

Ala Ala Ala Thr Gin His Asn Leu Glu Val Leu Ala Xaa Ser Arg Gly 
165 170 175 

Leu Leu He Trp Gly Pro Asp Ser Gly Ser Gin Ala Cys Gly Asp He 
lao 185 190 

Gly Pro Gly Arg Xaa Xaa Asp Pro Leu Thr He Val Asp Met Ala Val 
195 200 205 

Ala His Phe Ser Pro Val Asn Asp Leu Lys His Leu Asn He Met He 
210 215 220 

Thr Ala Gly Pro Thr Arg Glu Pro Leu Asp Pro Val Arg Tyr He Ser 
225 230 235 240 

Asn His Ser Ser Gly Lys Met Gly Phe Ala He Ala Ala Ala Ala Ala 
245 250 255 

Arg Arg Gly Ala Asn Val Thr Leu Val Ser Gly Pro Val Ser Leu Pro 
260 265 270 

Thr Pro Pro Phe Val Lys Arg Val Asp Val Met Thr Ala Leu Glu Met 
275 280 285 

Glu Ala Ala Val Asn Xaa Xaa Ala Ser Val Gin Gin Gin Asn He Phe 
290 295 300 

He Gly Cys Ala Ala Val Ala Asp Tyr Arg Ala Ala Thr Val Ala Pro 
305 310 315 320 

Glu Lys He Lys Lys Gin Ala Thr Gin Gly Asp Glu Leu Thr He Lys 
325 330 335 

Met Val Lys Xaa Asn Pro Asp He Val Ala Gly Val Ala Ala Leu Lys 
340 345 350 

Asp His Arg Pro Tyr Val Val Gly Phe Ala Ala Glu Thr Asn Asn Xaa 
355 360 365 

Xaa Xaa "Xaa Val Glu Glu Tyr Ala Arg Gin Lys Arg He Arg Lys Asn 
370 375 380 

Leu Asp Leu He Cys Ala Asn Asp Val Ser Gin Pro Thr Gin Gly Phe 
385 390 395 400 

Asn Ser Asp Asn Asn Ala Leu Hisj Leu Phe Trp Gin Asp Gly Asp Lys 



405 



410 



415 



Val Leu Pro Leu Glu Arg Lys Glu Leu Leu Gly Gin Leu Leu Leu Asp 
420 425 430 



Glu He Val Thr Arg Tyr Asp Glu Lys Asn Arg Arg 
435 440 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: YES 
(iv) ANTI-SENSE: NO 
(V) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

Xaa Gly Xaa Xaa Asp Xaa Xaa Xaa Xaa Gly Xaa Xaa Xaa Xaa 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
(V) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

Phe Ala Trp Val Asp Pro Gly Trp Asp Gly Asn Thr Leu Met 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO. 

(iv) ANTI-SENSE: NO 

(V) FRAGKENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

Ala Gly Trp lie Asp Ala Gly Phe Lys Gly Lys lie Thr Leu 
15 10 

(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

Ser Ala Val His Asp Pro Gly Tyr Glu Gly Arg Pro Glu Tyr 
15 10 

(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANT I -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

Pro Thr He Val Asp Ala Gly Phe Glu Gly Gin Leu Thr He 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 58: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:5a: 

Ala His Arg He Asp Pro Gly Trp Ser Gly Cys He Val Leu 
15 10 

(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
{ D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 
GAGTTAAATG CCTACACTGT ATCT 
(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 



CAGGACTCAG AAGCTGCTAT CGAA 

(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

CTGCACGTGC CCTGTAGGAT TTGT 

(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 
CTATTGAGTA CGAACGCCAT C 
(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63 



GTCACGCTTG CTCCACTCCG 
(2) INFOR14ATI0N FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
{ D ) TOPOLOGY : u nknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 
GAGGAGAGCA GGAAAGGTGG AAC 
(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65 
GCTGGGAGAA GACTTCACTG G 
(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 
{ D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 



His His Val Lys Leu lie Tyr Ala 
1 5 

(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 
(V) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:67: 

Lys Tyr Asp Ala Val lie Met Ala 
1 5 

(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(V) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68 

Glu Glu Asn Gin Val Val Leu 

1 ' ' 5 

(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

( D ) TOPOLOGY : unknown 



(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
(V) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:69: 



Pro Asp Trp Lys lie Arg Lys Glu 
1 5 



(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 471 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

ATGCTACTTC CAGACTGGAA AATCAGAAAA GAAATACTTA TAGAGCCATT TTCTGAAGAA 60 

TCGCTCCAAC CAGCAGGTTA TGACCTCAGA GTGGXSCAGAG AGGCTTTTGT TAAGGGGAAA 120 

TTAATCGACG TGGAAAAGGA AGGAAAAGTC GTT^.TTCCTC CAAGGGAATA CGCCTTAATC 180 

CTAACCCTCG AGAGGATAAA GTTGCCCGAC GATGTTATGG GGGATATGAA GATAAGGAGC 240 

AGTTTAGCAA GAGAAGGGGT TATTGGTTCT TTTC^CTTGGG TTGACCCAGG ATGGGATGGA 300 

AACTTAACAC TAATGCTCTA CAATGCCTCA AATCJAACCTG TCGAATTAAG ATATGGAGAG 360 

AGATTTGTGC AGATCGCATT TATAAGGCTA GAGCK5TCCGG CAAGAAACCC TTACAGAGGA 420 

AACTATCAGG GGAGCACAAG GTTAGCGTTT TCAiUVGAGAA AGAAACTCTA G 471 

(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 156 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

( D ) TOPOLOGY : unknown 



(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

Met Leu Leu Pro Asp Trp Lys lie Arg Lys Glu He Leu He Glu Pro 

15 10 15 

Phe ser Glu Glu Ser Leu Gin Pro Ala Gly Tyr Asp Leu Arg Val Gly 



20 



25 30 



Arg Glu Ala Phe Val Lys Gly Lys Leu He Asp Val Glu Lys Glu Gly 
35 40 45 

Lys val val He Pro Pro Arg Glu Tyr Ala Leu He Leu Thr Leu Glu 



50 



55 60 



Arg He Lys Leu Pro Asp Asp Val Met Gly Asp Met Lys He Arg Ser 



80 

65 



70 75 



ser Leu Ala Arg Glu Gly Val He Gly Ser Phe Ala Trp Val Asp Pro 

90 



85 



Gly Trp ASP Gly Asn Leu Thr Leu Met Leu Tyr Asn Ala Ser Asn Glu 



100 



105 



pro val Glu Leu Arg Tyr Gly Glu Arg Phe Val Gin lie Ala Phe He 
115 120 125 

Arg Leu Glu Gly Pro Ala Arg Asn Pro Tyr Arg Gly Asn Tyr Gin Gly 
130 135 140 

ser Thr Arg Leu Ala Phe Ser Lys Arg Lys Lys Leu 



145 



150 



155 



2) INFORMATION FOR SEQ ID NO: 72: 



(2) 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: YES 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:72: 



Xaa Gly Xaa Xaa Asp Xaa Xaa Xaa Gly Xaa Xaa Xaa Xaa 
1 5 10 



INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
(V) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

Phe Ala Trp Val Asp Pro Gly Trp Asp Gly Asn Thr Leu Met 
15 10 

INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
(V) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

Ala Gly Trp He Asp Ala Gly Phe Lys Gly Lys He Thr Leu 
15 10 

information' FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: peptide 



(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
(V) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:75: 

Ser Ala Vai His Asp Pro Gly Tyr Glu Gly Arg Pro Glu Tyr 
15 10 

INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
(V) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

Pro Thr He Val Asp Ala Gly Phe Glu Gly Gin Leu Thr He 
15 10 

INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 
(V) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 

Ala His Arg He Asp Pro Gly Trp Ser Gly Cys He Val Leu 

15 10 



(2) INFORMATION FOR SEQ ID NO:78: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 

Val Gly Leu He Asp Ser Asp Tyr Gin Gly Gin Leu Met 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 
{ D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(V) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 

Ala Gly val Val Asp Arg Asp Tyr Thr Gly Glu Val Lys 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 



(iv) ANTI-SENSE: NO 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 

Ala Gly Val He Asp Glu Asp Tyr Arg Giy Asn Val Gly Val 

15 10 • 

(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 
(V) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 

Thr Gly Leu He Asp Pro Gly Phe Gin Gly Glu Leu Lys Leu 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 
GACGACGACA AGATGCTACT TCCAGACTGG AAA 
(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 
GGAACAAGAC CCGTCCCACT TTCACAGATG AAGAG 
(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 



GAGGAGAGCA GGAAAGGTGG AAC 



(2) INFORMATION FOR SEQ ID NO: 85: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 



CTCCATGTCC CAACTCCGAT CAC 



(2) INFORMATION FOR SEQ ID NO: 86: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 38 base pairs 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
{ D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 
GGTTTTCCCA GTCACGACGT TGTAAAACGA CGGCCAGT 
(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 
GGUUUUCCCA GUCACGACGU UGUAAAACGA CGGCCAGU 
(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANT I -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88 
GACGACGACA AGATGCCCTG CTCTGAAGAG ACACC 



(2) INFORMATION FOR SEQ ID NO: 89: 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89 
GGAACAAGAC CCGTTTAATT CTTTCCAGTG GAACC 
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